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SE< 
ID 


i _ Nearest Neirhhnr ^BlastN vs. 
1 ACCESSION! DESCRIPTION 


Genbank) 
PVALU1 


[ WCarest Neighbor (BlastX vs. No^Redundanr 
5 ACCESSION J DESCRrPTInw 


Proteins) 1 

_p value! 




1 

AE00O392 h 

r 


Escherichia coli K-ll 
MG1655 section 282 
of 400 of the 
:omplete genome 


0.19 


1 3645960 


' kALUi^UJ'i: 

Jevidence=predicted by content 
1 1 -method=gcnef3nder;084; 1 - 
method_score=47.46; 1- 
lcvidence_end; 2- 
jevidence=prcdicted by match; 2 
|match w accessioi*=SWISS- 
PROTP23792; 2- 

match^description=DISCONNl 
JCTED PROTEIN.; 2-raatc... 


V f 

4.0 j 


559 


iziomo sapiens 
{(subclone 3_dl from 
jPl H25) DNA 
L81774 [sequence 


0.19 


j 4001725 


(AB0i5981)MnhA 
If Staphylococcus aureus) 


3.0 1 


560 


prosophlia 
Ifnelanogaster cosmid 
AL02U08 clone 137E7 


0.19 


! 4001688 


(AB0I5718) protein kinase 
fHomo sapiens] 




561 


jCarabus 
[leptoplesioides 
mitochondrial DNA 
for NADH 
Jdehydrogenase 
. AB001510 subunit 5, partial cds 


0.19 


3758855 


(Z9855I) MAL3P6.il 
FPIasmnrlitim fnl™™,-,,™! 






562 


Egernia stoleesii clone 
AF069696 EST! microsatellite 


0.19 I 


(AE00U26) Amino Acid 
(Branched) Transport 
3328994 IfChlamvdia tr*rh™*,„h.i 


2.4 


563 


F.pnnglei ppcAl 
[gene for 

phosphoenol pyruvate 
_ X64144 [carboxylase 


019 J 


< 

Jr 

3242974 c 


AF069555) G protein-coupled 
eceptor p2y3 (Meleagris 
f allopavo] 


1A 




564 


U56897 


i nut nan 

1 immunodeficiency 
virus type 1 gag 
polyprotein (gag) 
gcne< partial cds 


0.19 


2257710 r 


J73041) resol vase-like protein 
rhiobaciUus ferrooxidansl 


2.3 




565 


jy57975_J 


Danio rcrio Notch 
homologue 3 mRNA, 
complete cds [ 


0.19 1 


\u 

r 
r F 

c[ 
cc 
E< 
fn 
E* 

S e 

3874971 co 


mm similar to NAD 

rpendant 

)imerase/dehydratase family; 
)NA EST EMBLrC 10103 
>mes from this gene; cDNA 
>T£MBL:D66400 comes 
>m this gene; cDNA EST 
4BL;D70143 comes from this 

ne;cDNAESTyk493hlL3 
mes from 


2.3 






1.8 | 
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1 Nearest Neighbor (BiastN vs. Genbank) 


Nearest Neighbor (BtastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Human WD protein 






mucin - human >gi|501033 




573 


U57058 


ER10 pre-mRNA, 
partial cds 


0,19 


63 1302 


(U14383) mucin [Homo 
sapiens] 


0.60 


574 


AF034460 


Perricilliurn thornii 
internal transcribed 
spacer 1, 5.8S 
nbosomal RNA gene 
and internal 
transcribed spacer 2, 
complete sequence; 
and 28S nbosomal 
RNA gene, partial 
sequence 


0.19 


114136 


AMINO-ACID 
ACETYLTRANSFERASE 
Pseudomonas aeruginosa 

acetylglutamate synthase 
[Pseudomonas aeruginosa] 


0.35 


575 


U95098 


^nopuslaevis 
mitotic 

phosphoprotein 44 
mRNA, partial cds 


0.19 


105270 


alpha-2radrenergic receptor - 
human name 'ADRA2R' [Homo 
sapiens] 


0.27 


576 


AGO01475 


Homo sapiens 
genomic DNA, 2 lq 
region, clone: 
125H6N2 


0.19 


94977 


Pseudomonas sp. (DSM 6898) 
plasmid pfCB740 >gi(45867 
(X66604) ORF3 


0.16 


577 


M63284 


Mouse IgG receptor 
(beta-Fc-gamma-RII) 
gene, exons 9 and 10, 
clones lambda- 
Fc(3.2,93). 


0.19 


3024681 


TRANSCRIPTION 
INITIATION FACTOR TFED 
135 KD SUBUNIT (TAFIM35) 
(TAFII135) (TAHM30)of 
kina polymerase ii transcription 
factor TFIID [Homo sapiensl 


0.088 


578 


U38241 


Pseudomonas 
aeruginosa orotate 
phophoribosyl 
transferase (pyrE), 
catabolitc repression 
control protein (ere) 
and RNascPH (rph) 
genes, complete cds 


0.19 


3044086 


(AF055904) unknown 
Myxococcus xantbus] 


0.052 


579 


AF039734 


Lontra longicaudis 
transthyretin intron I, 
panial sequence 


0,19 


322759 


jistil extensin- like protein 
[clone pMG14) - common 
obacco (fragment) >gi| 19927 
Z 1401 5) pistil extensin like 
protein [Nicotiana tabacum) 


0.030 


580 


U95094 


Xenopus laevis XL- 
INCENP (XL- 
INCENP) mRNA, 
complete cds 


0,19 


2147194 c 


jollasen - Paralvineila grasslci 


0.002 



WO 01/02568 



PCI/USOO/18374 



SEC 
ID 


H Nearest 

i| 


Neighbor (BlastN vs. 
.i nccro lift roM 


Qcnbank) 
P VALUE 


1 Nearest Neigl 
I ACCESSION 


bor(BlastX vs. Non-Redundant F 
DESCRIPTION 


totems) j 

P value! 


581 


I AB004232 


Drosophila 
melanogaster mRNA 
for DAD polypeptide 
complete cds 


0.19 


2498765 


PEROXISOMAL MEMBRAN1 
JPROTEIN PEX16 lipolvtical 


E 

0.002 


582 


J AF098919 


Gallus gallus alpha- 
globin gene domain 5 
region 


a 19 


1086863 


T(U41272)T03Gil.6gene 
product [Caenorhabditis 
ieJegans) 


4e-05 


583 


AE001457 


Helicobacter pylori* 
strain J99 section 18 
of 132 of the 
complete genome 


a 19 


2924552 


(AL022018) I- 

Jevidence=predicted by content; 
|l-method=genefinder l 084; 1- 
method_score=16548; 1- 
evidence_end; 2- 
evidence=predicted by match; 2- 
match_accession=AA264666; 2^ 
match_description=LD0835 Up 
rime LD Drosophila melanoga... 


3e-05 1 


584 


L10329 


Plasmid RP4 traE 
gene, 3' end; traD 
gene, complete cds; 
trap gene, 5' end. 


0.19 


3878117 ] 


(Z49068) mitochondrial carrier 
protein 


8e-07 1 


585 1 


AE001155 


Borrelta burgdorferi 
section 41 of 70) of 
the complete genome 


0.19 1 


1 

861276 


(U23739) similar to TPR 
domains in e.g. yeast 3TI1 
protein [Caenorhabditis elegansl 


2e-12 J 


586 1 


U49979 < 


brt 1 virus El OR 
Nomolog gene, partial 
:ds, and DNA 
polymerase gene, 
:omplete cds 


0.19 


1 

3850072 | 


[AL033385) dna-directed rna 
x>Iymerase iii subunit 
Schizosaccharomyces pombc] 


lc-15 I 


587 I 


■ 

I 

U88155 s 


Kcnopus laevis 
^anGTPase 
ictivating protein 


0.19 


( 

995714 \[ 


X9I258) pid:e 198503 
Saccharomyces cerevisiae] 


4e-16 1 


588 1 


S 
P 

AF06I854 


>chizosaccharomyces 
ombe Clr4p (clr4) 
enc, complete cds 


0.19 


( 

|^ 

r 
r 

3242750 u 


AC005 164) match to ESTs 
VA73I 149 (NID:g2l40l38), 
VA731908 (NID:g2752719), 
U2$7S37(NID:gl933519) t 
LA262S 1 1 (NID:gl898382), 
nd AAS25820(NID:g2899132) 


5e-I9 


589 1 


£ 

M23865 s 


.cerevisiae CHS2 
enc encoding chitin 
ynthasc. 


o.is 1 


<NONE> 1 


<NONE> 


cNONE> 
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J SEC 
UP 


51 Neares 

21 

Iaccessioi 


t Neighbor (BlastN vs. 
M DESCRIPTION 


Genbank) 
| P VALUE 


- Nearest Neighbor (BlastX vs. Non-Redundsm Proteins) "1 

__ ACCESSION 1 DESCRIPTION [pvAtrJ 


590 


I U95094 


INCENP (XL- 
INCENP) mRNA, 
complete cds 


0.18 


<NONE> 


J <NQNE> 


<NONE>j 


1 591 


1 AF067610 


Caenorhabditis 
elegans cosrnid 
F41A4 


0.18 


<NONE> 


f <NONE> 


<NONE>J 


592 


I AF036329 


" Homo sapiens 
gonadotropin- 
releasing hormone 
precursor, second 
form (GnRH-0) gene, 
complete cds 


0.18 


J <NONE> 




1 <JNQNE> 1 


593 


I Z49216 


JL sapiens 
mitoxantrone- 
resistance associated 
mRNA 


0.18 


<NONE> 


<NONE> 


1 ^KTAKnc^ 1 


1 594 


X02I67 


Torulopsis glabrae 
mitochondrial DNA 
for tRNA-Thr>-His 
and -Glu upstream of 
cytochrome b gene 


0.18 


<NONE> 


<NONE> 


L<N^NE> 1 
<NONE>J 


595 


Z31561 


R-Communis 
(Carmcncita) Scri 
mRNA for sucrose 
carrier 


0.18 f 


<NONE> 1 


_ <NONE> 




[596 


L81692 j 


Omo sapiens 
(subclone 2_c9 from 
P1H56) DNA 
jequencc 


0.18 1 


1346575 ] 


55 KD ERYTHROCYTE 
vTEMB RANE PROTEIN 


<NONE>| 
8.4 1 


_597 | 


1 
I 

2 

a 
c 
s 
U 

X57310 s 


focardia 

actamdurahs pcbAB 
nd pcbC genes for 
Ipha-aminoadipyl-L- 
ysteinyl-D- valine 
ynthetase and 
*openicilIin N 
ynthase 


0.18 


5 

126404 L 


EED LIPOXYGENASES (L- 
) soybean >gi|170014 (J0321 1) 
j>oxygcnase(EC 1.13.11.12) 


6.5 1 


598 1 


S 
P 

a 

U18315 c< 


usscrofa " 
arathyroid receptor 
TH) mRNA. 
implete cds 


o.is 1 


1022323 |c 


K04647) collagen alpha-2(IV) 
lain [Mus musculus] 


3.8 j 



WO 01/02568 



PCT/USOO/18374 



SEQ 
ID 


H Nearest 

il 

IACCESSIOI 


Nciehbor (BlastN vs. 
* DESCRIPTION 


Genbank) 
P VALUE 


1 Nearest Neigh 
1 ^ACCESSION 


hor (BlastX vs. Non-Redundant Proteins) 

DESCRIPTION |p VALUE 


599 


1 AL010158 


Plasmodium 
falciparum DNA **» 
SEQUENCING IN 
PROGRESS *** 
from contig 3-85, 
complete sequence 


018 


! 2506816 


" VtKULAN LURETRDTEIN" 
PRECURSOR 
PROTEOGLYCAN CORE 
PROTEIN 2) (GLIAL 
HYALURON ATE -BINDING 
PROTEIN) (GHAP) >gi|60851< 
(U16306) chondroitin sulfate 
proteoglycan versican V0 splice 
variant precursor peptide 


3.7 J 


600 


I AB005287 


Bos taurus mRNA for 
ihrombospondin L, 
complete cds 


' - 
0.18 


I 

2146000 


U0002b protein - 
Mycobacterium tuberculosis 
tuberculosis] 

>gi| l694863|gnl(PIDje283373 
(233018) hypothetical protein 
Rv2963c [Mycobacterium 
tuberculosis] 


! 2.9 1 


601 


[ ALO2U08 


Drosophila 
melanogaster cosmid 
clone 137E7 


0.18 


I 3483032 


(AL031371) hypothetical 
protein SC4G2.06 
[Strepromyces coelicolor] 


2,9 [ 


602 


U57975 


Danio rerio Notch 
homologue 3 mRNA T 
complete cds 


0.18 


85719 


collagen alpha V(U) chain 
precursor - African clawed frog 


1.7 j 


603 1 


M30124 


P.acruginosa 
autonomously 
replicating sequence. 


0.18 


3878017 


(Sn>2 1 similar to Zinc H 
finger, C4 type (two domains); 
cDNA EST yk452f4>5 comes 
from this gene; cDNA EST 1 
EMBLT00774 comes from this 
gene receptor NIIR-3 | 
Caenorhabditis elegans] 1 


1.3 J 


604 I 


X54965 ( 


3.sp alpha 5HR DNA 


0 18 f 


< 
l 

134304 


TEM CELL PROTEIN T 
:hicken >«i 162845 CX6317 1 * 1 
ransforming capacity [Gal lus 1 
tallus) j 


1.3 1 


605 1 


■ 

i 
f 

U95098 i 


Kenopus laevis 
nitotic 

mosphoprotein 44 
nRNA. panial cds 


0.18 J 


( 
s 

1628403 s 


X98S93) hTAFJJ68 [Homo 

apiens] splicing [Homo 

apiens] [ 


1.3 j 


606 J 


C 

c 
s 

p 

t) 

U20793 c 


>rycto(agus 
uniculus renal 
odium-dependent 
hosphate transporter 
fpe II mRNA. 
omplete cds. 


o.is 1 


9 
C 
P 

P 

( 

1705984 Ic- 


n. KD TYPE IV j 
:OLLAGENASE 
RECURSORIV,92JC 
recursor - rat >gi|1022784 
U '36476) 92-kDa type IV 
ollage nasc [Rattus norvegicus] | 


12 j 
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Nearest NeiphborfBlostN vs. Genbank) 



JP | ACCESSION I DESCRIPTION [ p VALUE 



IHuman 

jcholecystokinin type 
JA receptor (CCK-A) 
607 » U23427 gene.exonsland2. 



608 



[Rattus norvegicus 
protein kinase MTJK2 1 
.U49953 ImRNA, complete cds 



(Human alpha globin 
[gene cluster on 
chromosome 16: zeta 
Jgene. 



610 I X62513 



iM.gallopavo gene for j 
jmetallothjoncm 



611 I X04862 



(Goat embryonic alpha I 
Iglobin gene zeta 
jexons 2-3 



612 I M12450 



(Rat vitamin D 
(binding protein 
(DBP) mRNA, 
Jcomplete cds. 



Mus musculus muscle) 
NSMikc l(Nspll) 
613 j AF038 539 ImRNA, comp lere cds | 

Bacteriophage P22 
[right operon, orf 48, 
(replication genes IS 
land 12, nin region 
genes, ninG 
(phosphatase, late 
control gene 23, orf 
160, complete cds, late 
(control region, start 
X78401 of ly sis gene 13 



614 



0.18 



0,18 



615 



0.18 



0.13 



0.1S 



J^grest Neighbor (BlastX vs. Non -Redundant Protein^ 



ACCESSION 



0.18 



0,18 



0.18 



Pig mRNA for inter* 
Jalpha-trypsin 
(inhibitor heavy-chain 
D38754 |HL complete cds | Q.is 



DESCRIPTION 



3261734 



(Z94752) hypothetical protein 
|Rv 1004c 



551238 



1585259 



2494740 



3297877 



1123087 



((X81847) pectate lyase I 
[[Erwinia carotovora] 



traJ gene [Amycolatopsis 
Imethanolical 

THkkjJrihiiCAL iMkb 



PROTEIN IN GBD 5-REGION 
(ORF4) >gi|2120954|pirp39562 
ORF4 - Alcaligenes eutrophus 
>gi[695274 (1,36817) ORF4 



0.97 



0.43 



0.41 



36837 androgen receptor B - human 



(AJ 130783) APC2 protein [Mus 
4210 432 [muscujusj 



(AJ224S6S) GNAS1 (Homo 
[sapiens | 



(U42436) C49H3.3 gene 
product [Caenorhabditis 
lelegansl 



1397275 



(U61947) C06G3J gene 
product (Caenorhabditis 
jelegansj 



0.31 



0.082 



0.038 



0,029 



0.009 



7e-06 
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Nearest Neighbor (BlastN vs. Cenbank) 



g> I ACCESSION! PESCRiFTIQN [ P VALUE 




Rabbit mRNA for 
laminopeptidasc N 
616 I X51508 (partial! 



617 I X54850 



IS.kluyveri linear 
Iplasmid pSKL DNA 
[for open reading 
[frames 1-10 



_618 I L21954 



619 I UQ9355 



Nearest Neighbor (BlastX vs. Noo-Rcdundant pST 



ACCESSION 



0.18 



0.18 



Human peripheral 
benzodiazepine 

[receptor gcne.exond 0-18 
Turyctoiagus 

cunicuius protein 

[phosphatase 2A1 B 

gamma subunit 

((skeletal muscle 

isolate) mRNA, 

[complete cds. | 0.18 



DESCRIPTION 
IlAr47 protein - fruit fly 



[ P value! 



630864 



|(Drosophila melanogaster) 
|>gif4 15947 (X75760) LRR47 
Drosoph ila mclanogasterl 
^HJlHLliLAL UJKU 



6e-07 



I PROTEIN C2C6.07IN 
CHROMOSOME I 
>gi|2370504|gnl|PID(e339I94 
pombe] 

>gi|345 1305|gnl|PID|el316730 
(AL031324) very hypothetical 
protein [Schizosaccharomyces 

-31834Q5 pombel | 2 e-08 

J T ALU i2bibj cUNA bS T 

EMBL:D70654 comes from this I 
gene: cDNA EST 
EMBLZ14359 comes from this | 
Jgene: cDNA EST 
EMBLD33409 comes from this I 
Jgene; cDNA EST 
jEMBL:D36239 comes from this | 
gene; cDNA EST 
[EMBL:Z 14766 comes from this I 
3925211 Jgene,,. 4^09 



620 1 X58715 



T cruzi hsp70 mRNA 
for 70 kDa heat shock! 
protein, partial cds 



oas 



_621 I AF060195 



622 I L27235 



Mus musculus 
Iproteasome regulator 
PA28 beta subunit 
gene, complete cds 



0.1S 



IMethylobacterium 
jextorquens serine 
[cycle proteins 



0.18 



3947877 



l(AL034382) putative mitosis 
land maintenance of ploidy 
[protein (Schizosaccharomyces 
Jpombcl 



3024081 



jMYOSlNUdlW CHAIN 

I KINASE. SMOOTH MUSCLE 
AND NON-MUSCLE 
ISOZYMES (MLCK) 
j(CONTAlNS: TELOKIN) 



8e-ll 



9e 12 



861276 



2688949 



(U28739) similar toTPR 
domains in e.g. yeast STI1 
protein [Caenorhabditis e)egans]| 



le-14 



(AF027208) AC133 antigen 
[Homo sapiens] 



ie-14 



WO 01/02568 



PCT/US00/I8374 



|SEQ 
E> I ACCESSION 



Nearest Neighbor (BlastN vs. Genbank) 



623 I AP006573 



,624 | AF001782 



J25 I AJ223364 



DESCRIPTION 



Drosophila virilis 
maJtasc 1 (Mavl) and 
maluse 2 (Mav2) 
genes, complete cds 



626 I JQ3Q59 



627 I AB0Q886O 



628 | AF027174 



_629 1 AF05965Q 



Staphylococcus 
aureus strain SA502A 
AgrB 

Homo sapiens germ 
line DNA upstream of 
jkappa locus 
Human 



glucoccrebrosidase 
(GCB) gene, 
complete cds 



Fugu rubripes Cal2 
gene for pheromone 
receptor, complete 
cds 



Neartet Neighbor (BlastX vs. Non^,,^ 



P VALUE 1 ACrF^THM 



DESCRIPTION 



0.18 



,2500558 



0,17 



0.17 



0.17 



Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
B) mRNA. complete 
cds 



0.17 



Homo sapiens histone 
dcacctylase 3 
(HDAC3) gene, 
complete cds 



0.17 



0.17 



<NONE> 



<NONE> 



FuiaiiVKribq nUclea^eI 
ni(RNASEni) 

>gi|3876420|gnl|PID(el346063 
(Z81070) similar to ribonucle 
(Caenorhabditis elegans] 



2e-23 



<NONE> 



<NONE> 



<NONE> 



<NONE> 
1AHJU4MJU) LJKAttP [Homo" 



2198849 



sapiens] >gi|2665826 

(AF035771) Na+/H+ exchanger j 

regulatory factor 2 [Homo 

sapiens] factor 2 f Homo 
sapiens] 

>gi|3618353|gnl|PCD|dl034182 
Jexchanger isoform A3 [Homo 
[sapiens] 



7.8 



539355 



JSCD25 protein (version 1) - 



7.5 



482118 



hypothetical protein C15H7.1 
jCaenorhabditis elegans 



4.5 
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Nearest NciphhorfBla^tN vs. Genbank) 



SEQ 

Jfr I accession! description 



630 I AF059650 



631 I X5S065 



(Homo sapiens histone 

deacetyiase 3 

(HDAC3)gene, 
jcomplete cds 

Chinese hamster 
Imeiallothionein II 
Igcne 



Nearest Neighbor (BlastX vs. Non.R^ undant Proteins) — \ 



P VALUE I ACrP^rnisr 



0.17 



0. 17 



632 1 U 15280 



IRattus norvegicus 
[oxytocin receptor 
(OTR) gene, exon 3 
land complete cds 



633 1 XQ4862 



634 I ALQ 10222 



635 I X60111 



IGoat embryonic alpha 
globin gene zeta 

exons 2-3 

Plasmodium 
(falciparum DNA *** 
SEQUENCING IN 
PROGRESS *** 
I from contig 4-09, 
(complete se quence 

iH.sapiens mRNA for 
MRP-1 

Orr 1 virus El OR " 



636 | (J49979 



homolog gene, partial 
cds, and DNA 
polymerase gene, 
Jcomplete cds 



465932 



3687237 



DESCRIPTION PVALtm 



■HYPOTHETICAL 03,2 IQ> 



PROTEIN F58A4.ll fN 
CHROMOSOME HI 
>gi|3874287|gnl|PID|e 1 344088 
EST EMBL:C12577 comes 
from this gene; cDNA EST 
yk227e7.5 comes from this 
gene; cDNA EST yk303dl.5 
comes from this gene; cDNA 
EST yk314cI2JS comes from 
this gene; cDNA ... 
EMBL:C1 1886 comes from this I 
gene; cDNA EST j 
EMBL:C 12577 comes from this j 
gene; cDNA EST yk227e7.5 
comes from this gene; cDNA 
ESTyk303di.5 comes from this! 
gene; cDNA EST yk3 14c 12.5 
comes from this gene; cDNA , 



0.17 



542565 



(AC0Q5 169) putative Cys3His 
zinc-finger protein 

cycilnhtypcU-rruittly 

(Drosophila melanogaster) 
>gi|4291 68 (X75027) 
Drosophila cyclin E type II 
[Drosophila melanogaster] 



0.17 



86837 



4.4 



1.5 



0.45 



androgen receptor B - human 1 0.080 



0.17 



0.17 



1177322 
3237306 



0.17 



3850072 



(X95466) CPG2 protein RUttus | 
norvegicus] 

>gi|l588593|prfI|2208498A 
plasticity-related gene [Rattus 
norvegicus] I 7e-07 

<U92715) breast cancer 
antiestrogen resistance 3 protein | 3e-09 



(AL033385) dna-directed rna 
polymerase iii subunit 
jchizosaccharomyces pombc] 
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Nearest Neighbor (BlastN vs. Genbank) 


| Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















637 


AF006573 


Drosophila virilis 
maltose 1 (Mavl) and 
maltase 2 (Mav2) 
genes, complete cds 


0.17 


2500558 


PUTATIVE RIBONUCLEASE 

in (rnase m) 

>gi|3876420|gnl|PID|cl346063 
(Z31070) similar to ribonuc lease 
(Caenorhabditis elcgansl 


2e-29 


638 


AE001141 


Borrelia burgdorferi 
(section 27 of 70) of 
the complete genome 


0.16 


1850327 


(U52370) fertilin beta [Homo 
sapiens) 


2,3 


639 


M72980 


Anthonomus grand is 
vitellogenin gene 
(VTG). complete cds. 


I 

0.12 


3242750 


(AC005 164) match to ESTs 
AA73 L 149 <NID:g2140138), 
AA731908 (NID:g2752719) P 
AA2S7837 (NID:gl9335l9), 
AA2628 1 1 (NID:gl898382), 
and AA825820 (NID;g2899132) 


2e-56 


640 


AF023532 


Simulium vittatum 
ATPase 6 gene, 
mitochondrial gene 
encoding 
mitochondrial 
protein, partial cds 


0.11 


<NONE> 


<NONE> 


<NONE> 


641 


U76523 


Sambucus nigra lectin 
precursor mRNA, 
complete cds 


0.10 


3482965 


(AL03 1369) putative protein 


0.49 


642 


AJ001596 


Danio rerio mRNA 
for opioid receptor 
homologue 


0.099 


1706694 


LANOSTEROL SYNTHASE 

5.4.99.7) - fission yeast 

1 Schizosaccharomyces pombe) 


2.3 


643 


U26341 


Oryctolagus 
cuniculus Na and CI 
dependent betaine 
transporter mRNA, 
complete cds. 


0.099 


2645804 


(AF033381) betaine 
homocysteine methyl transferase 
Mus musculus] 


0,59 


644 


Ml 1633 


Bacteriophage Cp-5 
(S. pneumoniae) 3" 
n verted terminal 
repeat. 


0,082 


2314695 


(AE000649) type IIS restriction 
enzyme R and M protein 


43 


645 


X74103 


Streptomyces sp. 
gene for alkaline 
serine protease I 


0.073 


1314734 


(U54641) 220 kDa silk protein 
Chironomus thummi] 


6.3 



WO 01/02568 



PCT/US00/18374 



Nj^^ vs. Cjmh-k, 1 Nearest Neighbor m^xZTu^S^^^ 




WO 01/02568 



PCTAJS00/18374 





PI Nearest 
. 1 ACCESSIOl 


t Neighbor <BIastN vs. 
Nf| DESCRIPTION 


Genbank) 
P VALUE 


J Nearest Neighbor (BlascX vs. Non-Redimdant Proteins! 

1 ACCESSION 1 DESCRIPTION L^attJ 


658 


I AB014546 


Homo sapiens mRN^ 
for KIAA0646 
protein, compleie cds 


0,066 


! 1082461 


homeotic protein HB9 - human 


0.38 


659 


I AF 104 156 


Rattus exulans isolate 
huahine30 
mitochondrial D- 
loop, partial sequence 


0,066 


1 1002380 


(U24189)RRM-typeRNA 
binding protein [Caenorfaabditis 
jelegans] 


1 0,29 J 


660 


j X97581 


M.muscuJus mRNA 
for spait transcription 
factor 


0.066 


I ( AL035075) putative myosin 
1 «. 4107313 heavy chain 


J 0,28 I 


661 


j D85378 


Human clone H20 N- 
acctylgJucosaminyltra 
nsferase III DNA, 
exon2 


0,066 


2114473 


(U96963) pl40mDia [Mus 
musculusl 


0.22 1 


662 


1 M97561 


Human (clone 

LA 179) chromosome 

2 1 sequence, 


0.065 


<NONE> 


<NONE> 


<NONE>| 


663 


AE001373 


Plasmodium 
falciparum 
chromosome 2. 
section 10 of 73 of 
the compleie j 
sequence j 


0,065 


<NONE> 1 


: ■ i 

<NONE> 1 


<NONE>| 


664 1 


S75479 


growth hormone 1 
receptor, growth J 
hormone binding 
protein {GHR/BP 
gene) (mice, C57 j 
black/6, Genomic, 1 
179 nt. segment 8 of j 

'°* L 


0,065' 1 


<NONE> 


<NONE> 


<NONE> 


665 J 


j 

I 

( 

AJF032922 c 


-iomo sapiens f 

j IIUIAIII «t uiuuinE I 

protein UNC-lSc 
UNC-lfc) mRNA, 
ompiete cds 


0.065 ( 


( 

3061308 f 


AB006074) topoisomerasc III 
Mus musculusl 1 


0,82 


666 1 


s 
n 
r 

s 
t 

S80986 3 


vpl40J=svp- related 
uclear \ 
eceptor/reiinoid I 
ignaling modulator j 
cebrafishes, mRNA. 
S76 nt] 1 


0.065 1 


( 
a 

132628S c 


U58734) weak similarity to 
rtkyrin G [Cacnorhabditis ( 
iegans] j 


0.12 



WO 01/02568 



PCT/US00/18374 



SB 

m 


81 Neares 
_|ACCESS101 


t Neighbor (BlasuN vs. 
V DESCRIPTION 


Genbank) 
P VALUE 


Nearest Neighbor (BlastX vs. Noti-Redundanri 
ACCESSION 1 DESCRIPTION 


Proteins) 
P VALUE 


667 


I X59552 


vj.uuiXj CAUCUS mivLNA 

for ventricular myosi 
heavy chain 


n 

0.065 


~ — FKUlfclN IN AMDl-RADbi 
JlNTERGENIC REGION 
>giI1077180|pir(|S49745 
I probable membrane protein 
1 YML034w - yeast 
f (Saccharomyces cerevisiae) 
>gi|575685 (Z46659) unknown 
orf* ten: 656, CAI: 0.13 
?497098 IrSaccharomvces cerevistael 


0.014 J 


668 


1 M72980 


Anthonomus grandis 
vitellogenin gene 
(VTG), complete cds. 


0.065 


3242750 


(AC005164) match to ESTs 
AA73U49 (NID:g2 140138), 
AA731908 (NID:g2752719) f 
AA287837 (NID:g 19335 19), 
AA262811 (NtD:gl898382), 
land AA825820 (NID:k2899132^ 


5e-33 I 


669 


AB014546 


Homo sapiens mRNA 

ior MAAUOk) 

protein, complete cds 


0.064 


<NONE> 


J <NONE> 


<NONE>| 


670 


M30039 


Sheeppox virus strain 
KS-1 ORFHM1 
gene, partial cds; 
ORF HAI? mH OPF 
HM3 genes, complete 
cds; and ORF HM4 
gene, partial cds 


0.064 


<N0NE> 


1 <NONE> 


<NONE>| 


671 1 


t 

} 

2 
( 

Z68013 t 


Laenorhabdiris ~ 
slegans cosmid 

sequence 

Caenorhabditis 

:lcgans] 


0.064 


<N0NE> 


<NONE> 


<NONE>| 


672 1 


I 
u 
k 
e 

AF041332 p 


iodo saltans 
nknown mRNA. 
inetoplast gene 
ncoding kinetoplast 
rotein. complete cds 


0.064 


<NONE> 


<NONE> 


^NONE>| 


673 | 


N 

J 
rt 

J00451 « 


louse germline IgG- 
chain gene, D-Jf-C 
*gion. and switch 
:eion. 


0.064 


<NONE> 


<NONE> < 


:NONE>| 



WO 01/02568 



PCT/US00/18374 



SEQ 
ID 


ll Nearest 
ACCESSIOl* 


Neighbor (BlastN vs. i 
f DESCRIPTION 


Uenbank) 
P VALUE 


i Nearest Neigh 
1 ACCESSION 


nor (biascx vs. Non-Redundant Proteins) J 
_ _ DESCRIPTION PVAMiJ 


674 


U41289 


bictyostelium 
discoideum K7 
kinesin-Uke protein 
mRNA, complete cds 


0.064 


1 3482972 


(AL031369) putative protein 


9.3 1 


675 


M37395 


L.Iactis (strain SKI 1) 
proteinase plasmid 
pSKlll DNA» 
complete cds. 


0.064 


1 993019 


(XS7246) alternative start codon 
Pseudorabies virus! 


I 

I 9,2 1 


O fO 


Z 15030 


Usapiens gene for 
ventricular myosin 
light chain2>:: 
gb|L01652[HUMVM 
LC Human 
ventricular myosin 
light chain 2 gene, 
seven exons. 


0.064 ! 


730343 


rRULAL A IN KLLEP 1 UK 

PRECURSOR (PRL-R) mouse 
>gi(220576|gnipiD|d 1 00 1535 
(D10214) prolactin receptor 
precursor [Mus musculus] 
?gi|zyj//u (L148H) prolactin 
receptor [Mus musculus] 
>gi|347842 (L 13593) prolactin 
receptor [Mus musculus] 
rcccp(or;ISOTYPE=Iong form 
Mus musculus] 


9.1 1 


i 

677 


Z12021 < 


3.max gene for 
:atalase 


0.064 


i 

< 

249871 1 f 


ORIGIN RECOGNITION 
COMPLEX PROTEIN, 
SUB UNIT 2 >gi|l 185461 
U38472) essential ORC2- 
elated fission replication factor 1 
Drp2 (Schizosaccharomyces 1 
>ombe] 1 


5,3 


678 


1 

f 

s 
I 

L05668 c 


Entamoeba histolytica 
protein 

erine/threonine 
anasc (pstkl) gene, 
omplete cds. 


0.064 J 


( 

733140 f 


U22453) carboxypeptidase J 
Simulium vittatum] j 


5.3 1 



WO 01/02568 
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SEQ 
CD 


f Nearest 
ACCESSION* 


Neighbor (BlasiN vs. < 
/ DESCRIPTION 


3enbank) 
P VALUE 


- Nearest Neifm 
ACCESSION 


bor (BlastX vs. Non-Redundant P 
DESCRIPTION 


roteins) 
P VALUE 


679 


U50715 


Mils musculus alpha - 
galactosidase A gene, 
complete cds 


0.064 


125398 


HYGROMYCIN-B KINASE 
(HYGROMYCIN B 
PHOSPHOTRANSFERASE) 
<APH(7 M )) 

>gi|66885|piri|WGSMHH 
hygromycin B 

phosphotransferase (EC 2.7.1.-) 
Streptomyces hygroscopicus 
>gi|581682 (X03615) pot. hyg 
protein [Streptomyces 
hygroscopicus] 
phosphotransferase [synthetic 
construct} >gi|2739064 cloning 
vector] >gi|2739068 
(AF025747) hygromycin B 
phosphotransferase [unidentified 
cloning vector] 


2.3 


680 


Z28182 


S.cercvisiae 
enrornosome Ai 
reading frame ORF 
VKL182w 


0.064 


1079035 


Om<2D) protein - fruit fly 
(Drosophila ananossae) 
>gi|443770|gnI|PID|d 1006095 
(D26553) ORF 


L8 


681 


M29917 


Human ornithine 
a m i n oir a n s rerase 
gene, exon I , 


0.064 


2317934 


(U97553) unknown [murine 
herpesvirus 68] 


1.4 


682 


AB020709 


Homo sapiens mRNA 
for KIAA0902 
protein, complete cds 


0,064 


861404 


(U29154) T07F12.3gene 
product [Caenorhabditis 
slegans] 


0,47 


683 


rlDUl'tJ'H) 


Homo sapiens mRNA 
forKlAA0646 
protein, complete cds 


0.064 


1 

1708118 : 


TOMEOBOX PROTEIN HB9 
>si[507425 


0.35 


684 


AB010427 


Homo sapiens mRNA 
For NORM, complete 
:ds 


0.064 


( 

2388676 


AF0I5539) precollagen P 
Mytilus edulis] 


0.018 


685 


< 
i 
1 
1 

U34774 c 


Drf virus ankyrin-like 
epeat protein. Fl 1L 
lomolog, and F12L 
lomolog genes, 
complete cds, 


0.064 


P 

731668 c 


;SF1 PROTEIN 
*gi|626624|piri|S46700 SSF1 
rotein - yeast (Saccharomyces 
erevjsiae) 


le*05 


686 


J 
r 
r 

AF022861 s 


wlus musculus 
teuropilin-2(a5) 
nRNA. alternatively 
pliced, complete cds 


0.064 


( 
d 

4091978 s 


AF073359) benzaldehyde 
ehydragenase [Pseudomonas 
p. DJ77] 


le-05 



1X6 



WO 01/02568 



PCT/US00/18374 




04 (p 



WO 01/02568 



PCT/USOO/18374 




WO 01/02568 



PCT/US00/18374 



SEQ 

rr\ 

iu 


gj Nearest Neighbor (BlastN vs. 

! 

_ ACCESSION I DESCRIPTION 
|E.coli genomic DNA 


Genbank) 
1 P VALUE 


j Nearest Nejfrh 
1 ACCESSION 


bor f BlastX vs. Non-Redundant P 
DESCRIPTION 


*roteins) 
P VALUE 


704 


U9U372 


iKohara clone 
#419(54.7 35.1 min.) 


0.063 


2498198 


CYTOCHROME RSrtl 
(CYTOCHROME B-561) 


3c- 19 


705 


M25528 


M.crystaJIinum 
ferrcdoxin-NADP+ 
reduciase (fhrA) 
mRNA, complete cds 


0.062 


j <NONE> 


<NONE> 


<NONE> 


706 


U45256 


Strongyloides ratti 
microsatellite B DNA 


0,062 


1 <NONE> 




<NONE> 


707 


U95102 


Xenopus lacvis 
mitotic 

phosphoprotein 90 
mRNA* complete cds 


0.062 


' <NONE> 


<NONE> 


<NONE> 


708 


Homo sapiens 
TEUAMLI fusion 
AF044317 pene, partial sequence 


0.062 


<NONE> 


_ <NONE> 


<NONE> 


709 


|i 

Y 
\i 
I 

Z73975 c 


Lacnornaoditis 
Regans cosmic! 
r06ES, complete 
sequence 
Caenorhabditis 
legans] 


0.062 I 


3108187 


sapiens] 


2.9 


710 


I 
\h 

X54232 p 


luman mRNA for 
eparan sulfate 
rotcaglycan 


0,062 I 


1076741 


:h kinase (EC 3,2.1.14) 
orecursor - rice precursor - rice 
>gi|S07955 (X87109)chitinase 
Orvza sativa] 


0,59 


711 


r 

fi 

X03073 s 


lovine retinal mRNA 

transducin beta- 
ubunit j 


0.062 


s 

477578 > 


ialidase - Actinomyces viscosus 
»gi|141852 


0.087 


712 


p.mclanogasier Jun 
Y12573 and 14-3.3 zcta »cne 


0.062 j 


3879551 ( 


Z70756) similar to collagen 


0.073 


713 


Bombus lerrestris 
mitochondrial 
cytochrome oxidase I, 
L26573 (partial cds. 


0.062 | 


( 
d 

1684959 I; 


U20600) NADH 
ehydrogenase subunit [Vanda 
lmdlara] 


0.039 



WO 01/02568 
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Nearest Nci fthb or (BlastN vs, Cenbank) 

jo I accession! DESCRIPTION I P VALUE 



SEQ 



JNearest Neither (BiastX vs. Non-Redundam Protein ' 



ACCESSION | DESCRIPTIO N IpvAlrrF 

~~ jAMlNUrfaH " 1 ' 1 RVALUE 



[Human ladinin CLAD) 
_7!4 I _U58994 [gene, complete cds [ 0*062 



28H07S 



IAMINOPEPTIDASE) 
(ARGINTNE 
IAMINOPEPTIDASE) 
'(CYTOSOL 

AMINOPEPTIDASE IV) (AP- 
B)>gi|2039143 (U61696) 
aminopeptidase B [Rattus 
norvegicusl 



Homo sapiens mRNA | 
for KIAA0653 
715 1 ABQ14553 [protein, partial cds 



0.062' 



1326350 



716 1 LI6898 



[Mus musculus 

I collagen alpha 1 type 

Jxvm mRNA, 5'end, 



0.062 



717 1 X99343 



jM< tuberculosis 
IguaA/B & choD 
{genes 



1723657 



j(U58748) similar to potential 

I transmembrane domains in S. 

cerevisiae nulcear division 
JrFTI protein (SP:P3 8206> 

H MH/AMfc A1LAL 38,5 KIT " 

PROTEIN IN ERV1-GLS2 
JINTERGENIC REGION 
Nipl32587|pi r ||S64322 

probable membrane protein 
|YGR031w - yeast 

(Saccharomyces cerevisiae) 
Ui|i3230i0|gnl|PID[e243277 
(2728 16) ORF YGR03lw 
j [Saccharomyces cerevisiae] 



0.062 



Homo sapiens MAD- 
related gene SMAD7 
j(SMADT) mRNA. 
718 I AF010193 complete cds | Q.Q61 



3873807 



(249907) B0491.1 
[[Caenorhabduis elegans] 



<NONE> 



9e-06 



5e-10 



ie-14 



2e~l9 



|_7I9 I 


L10182 


Mynneleonsp. 18S 
Iribosomal RNA. 


0061 


<NONE> 


<NONE> 


<NONE> 


720 1 


Y14723 


[ChoanomphaJus 
incertus 
mitochondrial 
cytochrome c oxidase 
subunit I gene, partial 


0.061 j 


<NONE> 


<NQNE> 1 


<NONE> 


1 721 1 


L27840 


Bovine respiratory 
syncytial virus 
nucleoprotein mRNA. 
complete cds. 


0.061 J 


542955 |i 


lucleoporin p62 - human | 


8,6 | 



1A°l 



WO 01/02568 



PCI7US00/18374 



Nearest Ncijghbor (BlasiN vs. Genbank) 



ID ACCESSION DESCRIPTION 



722 | U95094 



Xcnopus lacvis XL^ 
INCENP(XL~ 
|lNCENP) mRNA, 
apletecds 



723 I U95098 



724 | U26463 



[Xenopus laevis 
|rnitotic 

sphoprotein 44 
jmKNA, partial cds 
ppondiobolus 
Isalmonicolor 
NADPH-dcpcndent 
[aldehyde reductase 
[gene, complete cds 



iNearest Neighbo r (BlastX vs. Non-Kedundam Pmt^T 



P VALUE I ACCESSION 



DESCRiPTION 



0.061 



494454 



pus scrora 

scrofa Sus Scrofa 
>gi|142l2lO|pdb|lPCP| Porcine 
Spasmolytic Protein (Psp) (NrnrJ 
1 9 Structures) Spasmolytic 
(Polypeptide 

>gi|1633061(pdb|2PSP|B Chain 
IB, Porcine Pancreatic 
ISpasmolytic Polypeptide 



2.9 



0.061 



3845272 



(AE001417) hypothetical 
protein [Plasmodium 
[falciparum! 



1.3 



0,061 



1710288 



(U79302) unknown [Homo 
jsapien s] 



0.44 



725 1 AF035443 



[Xenopus laevis wee I 
horaolog rnRNA, 
jcomplete cds 
TLaenorhabditis 



726 | 248584 



Jelegans cosmid 

CI 321. complete 
[sequence 
[[Caenorhabditis 
[elegans] 



0.061 



3979720 



0.061 



3183491 



jEMBL:D33048 comes from this | 
Jgcne; cDNA EST 
|EMBL:D35780 comes from this | 
'gene; cDNA EST yk442c6.3 
[comes from this gene; cDNA 
EST yk442c6,5 comes from this | 
gene; cDNA EST yk398f6.3 
comes from this gene; cDNA 
IE... 

J>gi|39798i6|gnl|PID|ei358315 
EST EMBUD35780 comes 
from mis gene: cDNA EST 
yk442c6.3 comes from this 
gene; cDNA EST yk442c6.5 
comes from this gene; cDNA 
EST yk398f6.3 comes from this 
Igene; c DNA E... 

?i vi>orracAL sis job — 

PROTEIN C27F2.7 IN 
CHROMOSOME III 
>gi| 10655 10 (U40419) C27F2.7) 
gene product [Caenorhabditis 
elegans) j 3 e -H 



2e-04 



WO 01/02568 



PCT/US00/18374 



Nearest Neighbor (BlastN vs. Genbank) 



Nearest Neighbor (BlasrX vs. Non-Redundarit Protein^ 



|SEQ 

E> 1 accession! 



DESCRIPTION 



727 1 X61489 



Zea mays pep gene 
for (C3 type) 
'phosphoenol pyruvate 
[carboxylase 



P VALUE 



ACCESSION 



DESCRIPTIO N 
■^IIL'ALJIOKD 



728 



IDrosophila 
ImeJanogaster 
Windbeurel (wind) 
AFD25408 Jgenc complete cds 



729 | ABQ12I06 



734 



Brassica rapa mRNA 
for 5RK45, complete 
cds 



730 1 AF027174 



jArabidopsis thaliana 
(cellulose synthase 
catalytic subunic ( Ath 
IB) mRNA, complete 
Icds 



731 I YQ8682 



IH.sapiens mRNA tor 
carnitine 
palmitoyltransferase I 
type I 



732 | U95094 



IXenopus laevis XL- 
llNCENP (XL- 
INCENP) mRNA. 
Icomplete cds 



733 | AF064030 



Helianthus tuberosus 
lectin 2 mRNA. 
Icomplete cds 



Mus musculus 
Ponh'n52 mRNA, 
AF1Q0694 Icomplete cds 



0.061 



2496887 



jEROTEffJ C09F12 IN 
CHROMOSOME HI 
>gi|732538 (U22832) C09F5.2 
gene product [Caenorhabditis 
l eleganst 



0.061 



3703295 



(AC0057S3) R33083_l (Homo 
sapiens] 



0.060 



<NONE> 



<NONE> 



0.060 



<NONE> 



<NONE> 



0.060 



3319446 



(AF077541) contains similarity 
Ito class*! aminoacyl-tRNA 

synthetases [Caenorhabditis 
Jelegans] 



p value! 



ie-15 



2e~60 



<NONE> 



<NONE>| 



8.1 



(D73Q16) TRAE [Enterococcus 
0060 I 1041119 faecalisl 



0.060 



632209 



regulatory protein Rex - primate 
T-lymphotropic virus PTLV-L 
(fragment) 



0.060 



3098348 



(AF037401) neuropeptide 
Y/peptide YY receptor Yc 
If Danio rerio~ 



8.1 



3.7 



735 



737 



Xenopus laevis 
mitotic 

jphospho protein 90 
U95 102 mRNA. complere cds 



_736 I U76523 



Sambucus nigra lectin 
precursor mRNA 
complete cds 



Human nuclear pore 
[complex-associated 
U69668 IproteinTPR 



0,060 



125978 



|LAK PROTEIN PRECURSOR 
(LEUKOCYTE ANTIGEN 
I RELATED) 
>gi|70l46|pir||TPHULK 
leukocyte antigen-related 
protein precursor * human 
>gi[34267 sapiens? 



0.060 



2055394 



0.060 



4127354 



(U87306) transmembrane 
receptor UNC5H2 [Rattus 
norvegicus] 



( Y 14063) ChTl thymocyte 
antigen [Gallus gallus] 



1.2 



0.32 



9c04 



7~U 



WO 01/02568 



PCT/US00/18374 




WO 01/02568 
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|SEQ 
*D I ACCESSION 



748 1 AJ005813 



749 1 Y 16828 



_?50 i AFOI2899 



751 I AF074385 



752 1 U76523 



M92069 



754 1 S78516 



DESCRIPTION 



mRNAfof 
neoxanthin cleavage 
enzvme 



Nearest Maflibor (BlastX vs. Non-Redundant Pmh^ T 



P VALUE I ACCESSION 



Lagopus lagopus 
genomic 
microsatelliir 
sequence, LLST4 



0.056 



Sambucus nigra 
ribosome inactivating I 
protein precursor 
mRNA, complete cds I 0,05 5 



Sambucus nigra 
hevein-Jtke protein 
mRNA. complete cds | 0,055 



Sambucus nigra lectin) 
precursor mRNA, 
complete cds 



0.035 



Human retrovirus- like 
sequenccMSOleucine cj 0.034 
GlL=ankyrin-like 
repeat [orf virus OV, 
NZ2, Genomic. 1608 
Ml I 0.033 



Chicken myosin 
alkali light chain 







mRNA, complete cds, 




755 


M15646 


clone pFi. 


; Q,V27 






Arabidopsis thnliana 








cellulose synthase 








catalytic subunii f Ath- 








B) rnRNA. complete 




1 756 1 


AF027174 


cds j 


0.025 



DESCRIPTION 



L PVALl 



<NONE> 



<NONE> 



137339 



<NONE> 



2804465 



3334221 



<NONE> 



(AE001299) hypothetical 
[protein [Chlamydia trachomatis] I 4.3 



<NONE> 



fey Kb PROTEIN 
>gi|94375|pir||S 19150 
I hypothetical protein, 69K - 
I turnip yellow mosaic virus 



[<NONE> 



<NONE> 



|<NONE>| 



<NONE> 



<NONE> 



(AF043700) contains similarity 
jto human RNA- binding protein 

FUS/TLS (SW:Q23009) 
IfCaenorhabditis elcgans! I 0.15 

hydroxyphenylpyruvaI 

Jte D [OXYGENASE 4- 
Jhydroxyphenylpyruvate 
dioxygenase [Mycosphaerella 
Igra minicola] J 6c- 17 



(Z9604S) predicted using 
jGene Under 



5.0 



WO 01/02568 



PCT/US00/18374 



JP IaccessiqnI description 



AJ002291 



I Streptococcus 
Ipneumoniac pbplb 
Jgene, complete 



758 I X79104 



Cbowlinum (NCTC 
7272typeA)HA-33 
andP-21 genes. 



Xenopus laevis 
mitotic 

Iphosphoprotein 90 
U95102 mRNA , complete cds 



P VALUE 



^ACCESSION 



DESCRIPTION 



0,025 



phosphate pyrophosphokinase; 
IcDNA EST EMBL:D73I73 
comes from this gene; cDNA 
EST EMBL:D70909 comes 

Jfrom this gene; cDNA EST 
EMBL:D73449 comes from this 

[gene; cDNA EST 

|EMBL:D76l67 comes from this 



2648615 



(AE000970) tungsten 
I formy Imeihanoftiran 

dehydrogenase, subunitB (fwdB 
|2) fArchaeoglobus fulftidusl 



(D83785) expressed 

ubiquitously; product similar to 
iD.melanogaster mam protein. 
If Homo sapiens | 



760 I U36197 



'Chlamydomonas 
reinhardtii cobalamin 
independent 
methionine synthase 
mRNA, complete cds 



761 



Macaca mulatta 
(clone MMVA63) T 
Icell receptor alpha 
(TCR A) mRNA, 
L38865 | partial cds. 



762 { AFQ35948 



Mus musculus insulin 
receptor substrate-3 



763 I X98890 



[S.tubcrosum mRNA 
(for inorganic 
phosphate 
[transporter, StPTl 



PEROXISOME 

PROLIFERATOR 

I ACTIVATED RECEPTOR 

GAMMA (PPAR-GAMMA) 

>g!j283Sl8|pir||C422i4 

peroxisome proliferator- 
lactivated receptor gamma chain 
jAfrican clawed frog >gi|214668 

(M84I63) peroxisome 
Jprolirerator activated receptor 
[gamma [Xenopus laevis] 



1.7 



4.7 



0-023 



2500587 



0.023 



110072 



SPLICEOSOME 
ASSOCIATED PROTEIN 49 
spliceosome- associated protein 
SAP-49 - human >gi[556217 



proline-rieh protein MP4 - 
mouse >gi)53 182 



0,42 



<NONE> 



0.40 



WO 01/02568 PCT/USOO/18374 



Ngrest Neighbor (BlastN vs. Gcnbank) 
SHQ "~ 

jg Iaccession 



764 |X91212 



765 | ACQQ4498 



766 I U07083 



767 I X9889Q 



7681X56488 



769 



770 



774 



DESCRIPTION 



P VALUE 



ACCESSION 



L.escuJcntum mRNA 
for HD-ZIP protein j Q.Q22 



Homo sapiens 
chromosome 5. PI 
clone 1209CI (LBNLj 
H 104), complex 
sequence [Homo 
sapiens) 



0.022 



Human prostatic acid ] 
phosphatase (ACPP) 

:erie, exon I [ 0 ,022 

S. tuberosum mRNA 
for inorganic 
phosphate 
ransporter, StPTl } 0.022 
L.esculentum LAT59 | 
gene 5'flanking 
region, expressed 
during pollen 

maturation j 0022 



M34651 
X66727 



771 | U95102 



_772 I U95098 



773 | X71932 



Pseudorabies virus 
with upstream and 
downsteam 
uences. 
P.taeda gene for 
protochlorophyllide 
reductase 



Xcnopus laevis 
mitotic 

phosphoprotein 90 
mRNA, complete cds 1 



Xenopus laevis 
mitotic 

phosphoprotein 44 
mRNA, partial cds 



X87369 



H.sapiens XB »ene 
for tenascin-X, intron | 
14 



C.perfringens nanH I 
gene & QRF1. 2.3 A 4| 



0.022 



0.022 



0.022 



0.022 



0.022 



DESCRIPTION 



<NONE> 



0.022 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<N0NE> 



3878517 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



1854452 



3021699 



627059 



2062407 



<NONE> 

(Z92806) K10G4.4 
[Caenorhabditis ejegans] 



<NONE>| 



<NONE? 



(D89501) similar to salivary 
proline-rich protein P-B [Homo 
sapiens] 



<NONE> 



<NONE>| 



4.3 



4.3 



(AB005298) BAI 2 [Homo 
sapiens) 



liver stage antigen LSA-i - 
Plasmodium falciparum 
>gi|99!6(X56203) liver stage 
antigen 



(U78975) po)y(ADP^ribose) 
glycohydrolase [Bos taurus] 



0.64 



0.058 



>V5 



WO 01/02568 



PCT/US00/18374 



Nearest Neighbor f BlastN vs. Gcnbank) 



|SEQ 

S I ACCESSION DESCRIPTION 



775 1 Y14971 



776 | AF0O3133 



777 | AP003133 



Nearest Neighbor (BlastX vs. Non-K edundaiir 



GaJIus gallus mRNA 
for K60 protein 



Cacnorhabditis 
etegans cosmid 
T2IE3 



Cacnorhabditis 
tegans cosmid 
T21E3 



778 { U57645 



U67570 



Human helix- loop- 
helix proteins Id-1 
(ID-l)andId»l'(ID- 
1) genes, complete 
cds 



Methanococcus 
jannaschii section 1 12 
of 150 of the 
complete genome 



P VALUE ! ACCESSION 




0022 



134091 



DESCRIPTION 



|P VALUEl 



0.022 



1709997 



KD(U1SNRNP70KD) 
>gi|85864jpir||SO20I6 Ul 
snRNP 70K protein - African 
clawed frog >gi|65I79 
(X12430)U170K[Xenopu$ 
lac vis] 

Una repair protein 

RAD18 >gi|i 150622 protein 
radI8 [Schizosaccharotnyces 
pombej_ 



0.032 



0.022 



1709997 



DNA REPAIR PROTEIN 
RADl8>gi|H50622 protein 
radl8 [Schizosaccharomyces 
| pom be] 



2e-08 



2e-08 



0.021 



<NONB> 



<NONE> 



[_<NONE>| 



0.021 



<NONE> 



<NONE> 



780 1 L01584 



Trypanosoma crazi 
calcium-binding 
protein (CUB2.8) 
gene, complete cds. 



0.021 



<NONE> 



<NONE> 



<NONE> 



781 [ L04787 



Borrelia hermsii outer 
membrane lipoprotein 



0,021 



782 | U95094 



Xenopus laevis XL- 
INCENP(XL- 
INCENP)mR\A, 
complete cds 



<NONE> 



<NONE> 



<NONE> 



0.021 



783 



L36890 



Saccharomyces 
cerevisiae 
mitochondrion 
transfer RNA Thrl 
(tRNA-Thr) gene; 
transfer RNA-Val 
(tRNA-Val) gene; 
Oxi2 gene, complete 
cds; ORF2 and origin 
replication (ori5). 



<NONE> 



<NONE> 



<NONE> 



0.021 



<NONE> 



WO 01/02568 



PCTAJS00/18374 



Nearest Neighbor (BlasiN vs. Genbank) 



Nearest Neighbor (BlastX vs. Non-Redundant 1 




WO 01/02568 



PCT/US00/18374 




WO 01/02568 PCT/USOO/18374 



1 ID 


» r | Ncarea 

Iaccessioi 


Neighbor (BlastN vs, 

V DESCRIPTION 
Rabbit mRNA tor 


Genbank) 
P VALUE 


Nearest Neigj 


lbor (BlaaX vs. Non-Redundant Proteins) " 
DESCRIPTION PVATrrJ 


804 


1 X57689 


calcium channel BI-i 
(lambda CBP109 one 
CB10U 


I 
1 

0.021 


2959370 


(AL022117) hypothetical 
protein 


Ic^lO I 


805 


I U95102 


Xenopus laevis 
mitotic 

phosphopfotcin 90 
mRNA. complete cds 


0.021 


1109830 


(U41534) coded tor & 
elegans cDNA CEESI42F; 
Similar to helicases of 
SNF2/RAD54 family. 

vacnurnaoMlus eiegoJls | 


1 5c-U 1 


806 


J X77753 


H.sapiens TROP-2 
gene 


0.021 


1723657 


HkFUlHLULALiH.^ KiJ 

PROTEIN IN ERV1-GLS2 . 
INTER GENIC REGION 
>gi(2132587|ptr||S64322 
probable membrane protein 
YGR03 1 w - veast 

(Saccharomyces ccrevisiae) 
>gi|1323010|gnl|PlD|c243277 

f Saccharomyces ccrevisiae] 


Sell I 


1 807 1 


X98890 


for inorganic 
phosphate 
iiunsponcr, atrl l 


U.OZl 


2137872 


zinc finger protein PZF - mouse 


2c-19 1 


808 


AF027173 


Arabidopsis thalianu 
cellulose synthase 
catalytic subunit (Ath- 

A \ mPMA rrurinUt>> 

rt / iiiivii/v^ compicic 
cds 


0,020 


<NONE> 


<NONE> I 


<NONE>| 


809 1 


AJ224935 J 


Homo <ianir n<; 

Promoter Region and 
PCK2 gene 


0.020 


<NONE> 


<NONE> 


<NONE>| 


810 1 


< 
i 
1 

U76524 r 


Sambucus nigra 
ibosome inactivating 
>rotein precursor 
nRNA. complete cds 


0.020 


<NONE> 


<NONE> 


<NONE> 1 


811 I 


X99941 


V.chaliana GBFI 
:ene 


0.020 


<NONE> 


<NONE> 


<NONE> I 


812 1 


f 

S 
r 

t} 

X65138 n 


rt.musculus mRNA 
or tyrosine kinase > 
gb|$57168|S57l6S 
ck=Eph-rclated 
xeptor protein 
irosinc kinase {mice. 
iRNA, 4242 nt] 


0.020 


<NONE> 


<NONE> [< 


cNONE>| 



WO 01/02568 



PCT/US00/18374 



ID 


Nearest 

j 


Neighbor (BlasiN vs. < 


Genbank) 
r VALUE 


Nearest Neiah 
ACCESSION 


bor (BlastX vs. Non-Redundant Proteins) 

DESCRIPTION |pvArTrp| 




L04787 


Borrelia hermsii outc 
membrane lipoprotei 


r 

i 0.02O 


<NONE> 


<NONE> 


<NONE> 


814 


AJ223633 


tmerococcus taeciun 
genes encoding 
entexocin L50A and 
enterocin L50B plus 
5' and 3' flanking 
regions 


1 

0.020 


<NONE> 


<NONE> 


<NONE>| 


815 


AB012106 


Brassica rapa mRNA 
for SRK45, complete 
cds 


0,020 


<NONE> 




<J^ONE> I 


816 


AE001539 


Helicobacter pylori, 
strain 199 section 100 
of 132 of the 
complete genome 


0.020 


! 172292 


(L 1 1 895) transmembrane 
protein [Saccharomyccs 
cerevisiae) 


8.4 1 


817 


AF074386 


Sambucus nigra 
he vein-like protein 
mRNA. complete cds 


0.020 


94173 


pol polyprotein - Chinese 
hamster intracisternal A-particle 
CHIAP34 


8.0 1 


818 


M55264 


Herpesvirus saimiri 
dihydrofolate 
reductase (DHFR) 
and snRNA (HSUR) 
genes, complete cds. 


0.020 


2924250 


(Z98745)dJ29K1.2tHdmo 
sapiens] 1 


6,5 J 


819 


J 

AF052I63 < 


Htomo sapiens clone 
24456 mRNA 
sequence 


0.020 


< 

1706288 


D(4; UOPAMINfc KJbLliKI'DRj 
(D(2C) DOPAMINE' 
RECEPTOR) 

>gi|2I194S2|pir||I49246 D4 j 
dopamine receptor ■* mouse 1 
>gi|758427(U 19880) D4 
dopamine receptor [Mus 
musculus] [ 
>gi| 1095539|prfl|2 109259 A 
dopamine D4 receptor [Mus 1 
musculusl ! 


4.9 " | 


820 


< 
1 

AF074387 r 


»evein-Iike protein 
nRNA, complete cds 


0.020 


( 

2113798 ( 


283259) AmphiBrf38 j 
Branch iostoma floridael I 


4.7 j 


821 


1 

AF052163 s 


fomo sapiens clone 
4456 mRNA 
equence 


0.020 


I 
I 

I 
E 

0 

E 

3874733 


Z6;/54)cDNAEST " f 

:MBL:T02354 comes from this 
;ene; cDNA EST S 
:MBL;D32698 comes from this 
xne; cDNA EST 
:MBL:D354 1 1 comes from this 
ene j 


4,7 1 



WO 01/02568 



PCT/US00/18374 



SEC 
ID 


i\ Neares 

[accession 


„Neighbor (BlascN vs. 
^ DESCRIPTION 


Genbank) 
P VALUE 


j Nearest Neier 
1 ACCESSION 


ibor (BlastX vs, Non-Redundant J 
DESCRIPTION 


*roieins) I 

pvalueI 


822 


1 L11002 


Rat ankyrin binding 
glycoprotein- i relate 
mRNA sequence. 


d 

0,020 


552132 


(K01664) Bkm-like protein 
[Drosophila melanogaster] 






823 


I AE001539 


Helicobacter pylori, 
strain J99 section 10( 
of 132 of the 
complete genome 


) 

0.020 


1 172292 


(L 1 1895) transmembrane 
protein [Saccharoinyecs 
cerevisiae] 


3.8 
3.8 


824 


X98890 


S. tuberosum mRNA 
for inorganic 
phosphate 
transporter, StPTl 


0.020 


1 

3879798 


U-ui i_u; siinnai m ttk 

Domain (2 domains); cDNA 
EST yk390bl<X3 comes from 
this gene; cDNA EST 
EMBL;D71652 comes from this 
gene; cDNA ESTyk275f8.3 
comes from this gene; cDNA 
EST yk393b9.3 comes from this 
gene; cDNA EST yk37... 
>gi|388O220|gnl|PID|el349842 
yk390b!0.3 comes from this 
gene; cDNA EST 
EMBL.D71652 comes from this 
gene; cDNA EST yk275f8.3 
comes from this gene; cDNA 
EST yk393b9.3 comes from this 
gene; cDNA EST vk37... 


1.3 




825 1 


1 

U97519 < 


Homo sapiens 
podocalyxin-Iike 
protein mRNA, 
:omplete cds 


0.020 1 


■ 

j 
< 
( 

1345633 \ 


C-l- lb i RAH ! URUr^ULAl'E 
SYNTHASE CYTOPLASMIC 
(CI -THE SYNTHASE) 
(METHYLENETETRAHYDR 
OFOLATE 

DEHYDROGENASE / 
METHENYLTETRAHYDROF 
DLATE CYCLOHYDROLASE 
-l-tetrahydrofolate synthase 
Rattus norvegicus] 


0,066 1 


826 1 


( 
t 

AF003133 1 


Zaenorhabditis 
Icgans cosmid 
r2lE3 


0.020 I 


I 

I 

T 

1709997 r 


3NA REPAIR PROTEIN 
*ADl8>gi|I 150622 protein 
ad 1 8 [Sch izosaccharomyces 
>ombe| 


2e-07 I 


827 1 


5 
c 
8 
8 
n 

P 

U32857 s 


laccharomyces ! 
erevisiae VARl 
cne* mitochondrial 
cne encoding 
mitochondrial 
rotein* 3" processing 
te. partial sequence 


0.019 | 


<NONE> 


<NONE> Jj 


:NONE> 1 



2JM 



WO 01/02568 



PCT/US00/18374 



Nearest Neighbor (BlastN vs. Genbank) 



ISEQ 

I accession! DESCRIPTION 




828 I AF027174 



Afabidopsis thaliana 
cellulose syn chase 
catalytic subunii (Aih 
IB) mRNA, complete 
cds 



J29 1 AFQ34099 



Laccaria bicolor 
glyoxal maiate 
synthase protein 
mRNA. complete cds 



Nearest Neitmbor (BlastX vs. Non-Redundant twHZT 



ACCESSION 



0.019 



2506381 



0.019 



3880930 



DESCRIPTION 



Noi'Ori HOMOLOG 
PROTEIN 4 PRECURSOR 
(TRANSFORMING PROTEIN 
INT-3) mammary gene mRNA, 
complete cds.J, gene product 
us musculus 



fMusmusculus] 



similar tfl 

Phosphoglucomutase and 
phosphomannomutase 
phosphoserine; cDNA EST 
EMBL;D36I68 comes from mis 
gene; cDNA EST 
EMBL:D70697 comes from this 
gene; cDNA EST yk373h9.J 
comes from this gene; cDNA 
EST EMBL:T0Q8... 



P VALUE! 



3.3 



IMus musculus 
Pontin52 mRNA, 



6c- 15 



830 AF1Q0694 complete cds 



831 



U24578 



[Human RP I and 
complement C4B 
precursor (C4B) 
Igenes. partial eds 



0.018 



<NONE> 



<NONE> 



<NONE>| 



0.013 



478673 



832 I U76523 



[Sambucus nigra lectin 
[precursor mRNA, 
(complete cds 



aropl 

DioeWuran- 

degrading bacterium 
DPO360 2,3- 
Idi hydroxy bi phenyl 
1^2-dioxygenase 
(bphC) gene. 
I complete cds and 2- 
|hydroxy-6-oxo~6- 
phenylhexa-2.4- 
Idienoic acid 
U57649 hydrolase 



0.011 



<NONE> 



proline- rich protein precursor - 
kidney bean vulgaris] 



3.1 



<NONE> 



833 



834 



0.011 



fZ.mays gene tor 
I phosphoenol pyruvate 
X15642 carboxylase 



<NONE> 



0.01 1 



<N0NE> 



<NONE> 



<NONE> 



<NONE> 



835 



ICelcgans collagen 
X5I623 [gene col>13 



0.010 



836 



U83656 



Rattus ttorvegicus NT 
I KB gene, pro mo lor 
{region 



1695686 



0.008 



4240195 



(D83706) pyruvate carboxylase 
[Bacillus stearotherrnophUusl 



3.1 



Is 



(AB020660) KIAA0853 protein 
[Homo sapiens! 



10.0 



WO 01/02568 



PCTYUS00/18374 



kSgNI N earest Neighbor (BlastN v S . Genbank) 
I SEQ| 



]P I ACCESSION DESCRIPTION 



I Homo sapiens gene 
encoding retina- 
specific guanyly! 
337 1 AJ222657 cyclase 



838 I ABQ12106 



L&rassica rapa mRNA 
for SRK45. complete 
cds 



Nearest Neighbor (BlastX vs. Non-Rgdundant Proteins) 



P VALUE | ACCESSION 




DESCRIPTION 



0.008 



417704 



839 I U76524 



Sambucus nigra 
ribosome inactivating 
[protein precursor 
ImRNA, complete cds 



840 



(Sambucus nigra 
jnbosomc inactivating 
I protein precursor 
AF012899 mRNA. comply cds 



0.008 



544024 



0.008 



532468 



_841 | AF074385 



ISambucus nigra 
jbevcin-iike protein 
[mRNA. complete cds 



842 1 1/48734 



Human non-muscle 
alpha-actinin mRNA, 
complete cds 



843 



I Homo sapiens 3- 
hydroxyisobutyryl- 
coenzyme A 
hydrolase mRNA, 
_U 66669 [complete cds 



844 I D16492 



Mouse mRNA for 
fP 100 serine protease 
[of Ra-reactive factor 
t(RaRF) T complete cd^ 



0.008 



0.008 



4101160 



17 11520 



O ? 0OS 



2829922 



0,007 



<NONE> 



0.007 



<NONE> 



(OKHA/tB) [CONTAaNsT 



RNA-DIRECTEDRNA 
POLYMERASE ; HELICASE; 
PROTEASE 1 



: ] 

UlLUklDfc LHANNJiC 

PROTEIN, SKELETAL 
MUSCLE (CHLORIDE 
CHANNEL PROTEIN 1) (CLC-j 
1) human >gi|397143 (225587) 
human C1C- 1 muscle chloride 
channel [Homo sapiens] 
>gi|398I61 (Z25884) human 
CIC-1 muscle chloride channel 
Homo sapiens! ■ 



(U13643) similar to reverse 
transcriptase; possible 
pseudogene [Caenorhabditis 
elegansl 



(AF002589) cytochrome 
oxidase I [Austrofundulus 
lironaeus] 



5RB-S/9 PROTEIN 
>gi| 1334996 



(AC002291)extcnsin 
[Arabidopsis thaliana] 



7.4 



4.6 



3.8 



2.7 



1.6 



0.11 



<NONE> 



l<NONE> 



<NONE> 



I <NONE> 



WO 01/02568 



PCT/US00/18374 





Nearest Neighbor (BlascN vs. Genbank) 


Nearest Neighbor (BlastX vs, Non-Redundant Proteins) 


SEQ 
ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Human 










845 


D90923 


iinmuiK)deficiency 
virus type 1 proviraJ 
DNA for envelope 
glycoprotein, partial 
cds. isolate 03S 


0.007 


<NONE> 


<NONE> 


<NONE> 


846 


ABQ 1 1087 


Homo sapiens mRNA 
forKIAA05l5 
protein, partial cds 


0.007 


<NONE> 


<NONE> 


<NONE> 


S47 


AE000688 


Aquifex aeoltcus 
section 20 of 109 of 
the complete genome 


0.007 


<NONE> 


<NONE> 


<NONE> 


848 


X63723 


B.bovis WCLL 
mRNA 


0.007 


<NONE> 


. <NONE> 


<NONE> 


849 


AF074386 


Sambucus nigra 
hevein-like protein 
mRNA, complete cds 


0007 


<NONE> 


<NONE> 


<NONE> 


850 


J00O97 


Human beta globm 
region Alu repetitive 
sequence type T. 


0.007 


<N(JNh> 


<NUNfc> 


<1NUINC> 


851 


D90923 


Human 

immunodeficiency 
virus type I proviraJ 
DNA for envelope 
glycoprotein, partial 
cds, isolate 03 S 


0.007 


<NONE> 


<NONE> 


<NONE> 


852 


U95094 


Xenopus laevis XL- 
INCENP(XL- 
TNrFNP^ mRNA 
complete cds 


0.007 


<NONE> 


<NONE> 


<NONE> 


853 


X91618 


T.castaneum 
hunchback gene 


0.007 


<NONE> 


<NONE> 


<NONE> 


854 


X03838 


Rat nontrunscribed 
spacer (NTS) 
downstream of 28S 
fRNA gene 


0,007 


<NONE> 


<NONE> 


<NONE> 


855 


M55049 


Rattus norweeicus 
interleukin-2 receptor 
alpha chain (CD25) 
mRNA, complete cd$. 


0.007 


<NONE> 


<NONE> 


<NONE> 



WO 01/02568 



PCT/US00/18374 



m I accession! description 



ptsapiens CpG DNA. 
clone 9c2, reverse 
856 1 264318 read cpsftcZnla. 



lArabidopsis thaliana 
cellulose synthase 
catalytic subunit (Aih 
- A) rnRNA, complete 

857 1 AFD27173 |cds 



Arabidopsis thaliana 
cellulose synthase 
(catalytic subunit (Aih 
^ IB) mRNA, complete 

858 | AF027174 cd* 

Sambucus nigra 
Iribosomc inactivating 
Jprotein precursor 

859 1 AFQ12 899 ImRNA. comp jete cds 



P.falcipamm 
Jcomplete gene map o 
860 I X95276 Jplastid-IIke DNA 



P VALUE I ACCESSION 



0.007 



0.007 



0.007 



0.007 



aoo7 



Lycopersicon 
Jesculenturn class II 
Ismail heat shock 
(protein Le-HSP17.6 

861 | U723 96 ImRNA. comp lete cd* 
IMus musculus 
Pontin52 mRNA, 

862 I AF1QQ694 complete cds 



Leucania seperata 
jnuclear polyhedrosis 
virus DNAfbrpn. 
, |xe. envelope protein, 

863 | AB0QQ383 complete cds 

Human DNA for 

864 J D86566 NOTCH4, partial cds 



Sambucus nigra 
Iribosome inactivating 
protein precursor 
865 I U76524 |mRNA. complete cds 



0.007 



0.007 



<NONE> 



_<NONE> 



<NONE> 



<NONE> 



<NONE> 



0.00? 



<NQNH> 



DESCRIPTION p V ALT TP 



<NONE> 



<NONE> 



<NONB> 



_ <NQNE> 



[<NONB>| 



WO 01/02568 



PCT/US00/18374 



fct^J Nearest Neiehbor (BlastN vs. Genbank) 



|SEQ 

E> \ accessiqnI description 



866 



lArabidopsis thaiiana 
[cellulose synthase 

Jytic subunit (Ath 
|A) mRNA. complete 
AF027173 cds 



.Nearest Neighbor (BlastX vs. Non-Redundant Pitkin*! 



P VALUE ACCESSION 




0.007 



3047072 



(AF058825) No definition line 
found f Arabidopsis thaiiana] 



867 I AF027174 



(Arabidopsis thaiiana 
[cellulose synthase 
(catalytic subunit (Ath-j 
IB) mRNA, complete 



(U29359) SpaO [Salmonella 
0.007. | ' 975754 [ enteric a | 



868 



iSambucus nigra 
Iribosome inactivating 
protein precursor 
U76524 mRNA, complete cds 



0.007 



1213557 



(U50199) coded for by C. 
elegans cDNA yk89e9*5; coded 
for by C elegans cDNA cm7g5; | 
coded for by C- elegans cDNA 
cml4b9;codcdforbyC. 
elegans cDNA yk52g5.5; coded | 
for by C. elegans cDNA 
yk76e5.5; coded for by C 
[elegan s cDNA ykt3lfl 1.5; c. 



869 



Brassica rapa mRNA i 
for SRK45, complete 
AB012I06 Icds 



0.007 



2499568 



ISOASPARTATE(D- 
ASPARTATE) 0- 
METHYLTRANSFERASE 
(PROTEIN-BETA- 
ASPARTATE 

I METH YLTR ANS FERASE) 
(PIMT) (PROTEIN 
ISOASPARTYl/D- 
ASPARTYL 

METHYLTRANSFERASE) 
methyltransferasc [Drosophila 
melanogaster] >gi|l 171337 
mclanopastcrl 



8.9 



8.6 



8,4 



8.3 



870 J AF093268 



iRattus norvegicus 
homer- lc mRNA, 
[complete cds 



(AF095353) toll-like receptor 4 
0 007 I 4092077 Imutant [Mus musculusl 



6.2 



871 



[Sambucus nigra 
Jhevein-like protein , 
AF074386 mRNA. complete cds 



f (M80653) tctrahemc 

0.007 I 151377 [Pseudomonas stutzeri] 



6.2 



872 



Bos taurus (clone 
L423 19 |Sal3.8) tristetraprolin | Q.QQ7 



2507337 



TRANSCRIPTION 
TERMINATION FACTOR 
RHO 



5.5 



WO 01/02568 



PCT/US00/18374 



SEC 
ID 


Nearest Neighbor (BlastN vs. 
> j 

_ accession! DESCRIPTION 


Oenbank) 
P VALUE 


| Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 
j ACCESSION j DESCRIPTION l P VALUE 


_873 


M59815 


(Human complement 
component C4A 
Igene. cxons 10 
(through 41. 


0007 


j 3876769 


(Z6(^oi7) Similarity to Human J 
Prolyl 4-hydroxylase alpha 
subunit (SW:P4HAJIUMAN); 
IcDNA EST yk2l9gi2J comes 
jfrom this gene; cDNA EST 
yk3!9d8_5 comes from this 
gene; cDNA EST yfc339dl 13 
comes from this gene; cDNA 
ESTyk371c9,3... 


5.3 


874 


X63723 


IB bnvfc Wf l 1 
ImRNA 


0.007 


I 2969893 


(AJ001858) human SBVI2 
[Homo sapiens] 


5.3 


875 


AB009864 


[Expression vector 
dME18S~FT 3 
complete sequence 


0.007 


2137618 


p45 NF-E2 related factor 2 - 
mouse musculus] 


51 


876 


U76524 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


0.007 j 


2804497 


(AF043705) contains similarity 
to C2H2-rype zinc fingers 


5.0 


877 


U95102 


Xcnopus laevis 
mitotic 

phosphoprotein 90 
mRNA, complete cds 


0.007 \ 


440298 


(L27469) product of alternative 
splicing [Drosophila 
melanogaster] 


4.7 


878 


X58869 


Chicken mRNA for 

aldehyde 

dehydrogenase 


0007 1 


1185062 1 


(L75945) flagellar export 
protein fBorrelia burgdorferi! 


4.1 ! 


879 


Nephila clavipes 
minor ampullate silk 
jprotein MiSpl 
AF027735 ImRNA, partial cds 


0.007 ! 


(AF0l7434)pmMikegene 1 
jproduct (Methylobactcrium 
2394390 lextorauensl 1 


4.0 


880 


! 

AF105228 r 


3os taurus tuftelin 
nRNA. complete cds 


0.007 


3036802 ( 


AL022373) putative protein 


3.9 


881 


r 
p 

AF 100694 |c 


4us musculus 
ontin52 mRNA, 
Dmplete cds 


0.007 1 


ft 
I 

C 
p 

r 

c 
c 
£ 
ft 
|y 

2500S14 |s 


■i i FUlHhllLAL KIT r 

PROTEIN T27F2.1 IN j 

:hromosome v 

>gi|38803Il|gnl|PID|el349855 
\X42 (SW:BX42_DROME>; 
DNA EST EMBL.C07233 
omes from this gene; cDNA 1 
ST EMBL:C08532 comes ) 
r om this gene; cDNA EST | 
k501hI0.3 comes from this 
ene;cDNAESTyk501fl.3... | 


3.8 



WO 01/02568 



PCT/US00/18374 



^1 Nearest Neigh bor (BlastN vs. Genbankl 
SEQ ~~" ~~ ■ " ' - ^ 

*P 1 ACCESSION 



882 | X93567 



883 I ABQ12106 



884 | AF041056 



_88j I A F093268 



386 1 M74798 



DESCRIPTION 

major mRNA for 
bcia-tubulin n404bp> 



P VALUE 



Brassica rapa mRNA 
for SRK45, complete 
cds 



Homo sapiens 
WSCR4gcnc, cxons 
and 4 



0.007 



0007. 



Rattus norvcgicus 
homer- 1 c mRNA, 
jpmplete cds 



Nearest Neighbor (BlastX vs. Non ^u^ 



ACCESSION 



2317862 



DESCRIPTION 



IP VALUE 



3881103 



0,007 



135817 



887 | Z62997 



888 | AFQ74386 



889 | L29426 



evea brasiliensis 3 
hydroxy-3- 
mcthylglutaryl- 
coenzyme A 
reductase gene, 3' 
end. 



0,007 



1723518 



modules 4 A 5 [Streptomyces 
fradiael 

(AL032646) predicted using 
Gencfinder; cDNA EST 
EMBL:D76407 comes from this j 
gene; cDNA EST 
EMBL:CG8999 comes from this | 
gene; cDNA EST ykl99bl2.5 
comes from this gene; cDNA 
EST yk282a4.5 comes from this | 
gene; cDNA EST EMBL.-CO... 



1HKOMUW RECEPTOR 
PRECURSOR human 
>gi|339677 (M62424) thrombin 
receptor rHomo sapiens^ 



H Y TOTlWllt AL 32.2 KD 
PROTEIN C22E 12.04 IN 
CHROMOSOME I >gij 1 220279 1 
(Z7Q043) unknown 



H.sapiens CpGDNA 
clone 76gll. reverse 
read cpg76gll.rtla . 



Sambucus nigra 
hevein-Iike protein 
mRNA. complete cds 



Synechocystis species 
(strain PCC 6803) 
drgA gene, complete 
cds. 



0007 



1001282 



0.007 



1176532 



0.007 



2498317 



0.007 



3SS2275 



(D64003) polyA polym erase 



PROTEIN C34E 10.8 IN 
CHROMOSOME ffl 
>gt|50073l (U 10402) weakly 
similar to protein C kinase 



substrate [Caen orhabditis 
bVA-1 WLVmOTEIN 



PRECURSOR nematode 
polyprotein antigen precursor 

Dictyocaulus viviparus] 
> si if 1 5 85 42 1 |prfl|2 1 244 14 A 
polyprotein antigen/allergen 

Dictyocaulus viviparus] 



3.0 



ABO ! 8320) KIAA0777 protein | 
Homo sapiens] 



2.7 



2,2 



2,1 



1.9 



1.8 



1.2 



1.1 



i3t 



WO 01/02568 



PCT/US00/18374 



SE 
CD 


Nearest 
ACCESSIO 


Neighbor (BlastN vs, < 
I DESCRIPTION 


jenbank) 
P VALUE 


Nearest Neighbor (BlastX vs. Non-Redundant P 
ACCESSION l DESCRIPTION 


roteins) 
P VALUE 


890 


D83329 


Mus musculus DNA 
for prostaglandin D2 
synthase, complete 
cds 


0.007 


1001741 


[(D64004) hypc^^ 


0.97 


891 


AB012106 


Brossica rapa mRNA 
for SRK-tf, complete 
cds 


0.007 


1723928 


lHvKrrH6tic 4 AL uiko 

(PROTEIN IN NUT1-AR02 
INTERGENIC REGION 
PRECURSOR YGLl49w - 
Jyeast (Saccharomvccs 


0,94 


892 


U76524 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 


0.007 


121452 


KjLU 1'kNlR HTgH " 
MOLECULAR WEIGHT 
SUBUNIT 12 PRECURSOR 
>gi|82606|pir||A24266 glutenin 
high molecular weight chain 12 
precursor - wheat >gi|2 1779 


0.79 


893 


AF027173 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunii (Ach- 
A) mRNA. complete 
cds 


0.007 


927287 | 


(U30294) ORF2 [Prevotella 
ruminicola] 


0.35 


894 


Y119I8 


Rsapiens IMAGE 
cDNA clone 26881 


0.007 


1055188 


(U40061) contains similarity to 
transmembrane domains like 
those found in sugar transporter 
proteins 


0.26 


895 


L36827 


Mus Musculus 
alpha A-cry stall in- 
ending protein 1 


0.007 


4063019 


(AF0S3O61) ABC transporter 
TliF [Pscudomonas fluorescens] 


0.21 


896 


L36827 


Mus Musculus 
alphaA-crystallin- 
binding protein I 


0.007 


it 

4063019 


K f\r\Joj\j\}L) rVBL transporter 
ritF (Pscudomonas fluorescens] 


0.20 


897 


] 
( 

Z657I9 i 


^sapiens CpG DNA. 
:lone 54c 10, reverse 
eadcpg54cl0rtla. 


0.007 


1097307 1 


-IIC- 1 aene [Homo sapiens] 


0.20 


898 


I 
1 

AF064029 


lelianthus tuberosum 
ectin 1 mRNA, 
omplete cds 


0,007 


1 
f 
|( 
r 

1174915 p 


JTROPHIN (DYSTROr^HIN- 
l ELATED PROTEIN 1) 
DRPO(DRP) 

>gi|284488|pir||S28381 utrophin 
rotein) [Homo sapiens] 


0.002 


899 


c 

AF05L730 s 


dus musculus 
athepsin S < CaiS) 
cne, e.xon 6 


0.007 


i 

1707017 |f 


U7S721) RNA helicase isolog 
-\rabidopsis thaliana] 


0.001 



WO 01/02568 



PCT/DS00/18374 



SEC 
ID 


5] Nearest 

n 

jACCESSIOf 


Neighbor (BlastN vs. 

4 DESCRIPTION 
dryctolagus 


Genbank) 
P VALUE 


Nearest Neiar> 
ACCESSION 


bor (BlastX vs. Non-Redundant F 
DESCRIPTION 


•roteins) 
P VALUE 


900 


J U62398 


cuniculus 
£p42/basigin/OX~ 
47/HT7 mRNA, 
complete cds. 


0.007 


2370494 


(298944) hypothetical protein 


2e-04 


901 


X76341 


M.musculus 
glutathione reductase 
mRNA. 


0.007 


3513303 


(AC005594) R26984.1 (Homo 
sapiens] 


8e-07 


902 


M26215 


Rat (lambda 20B0.5) 
M-type 6- 
phosphofructo-2- 
kinase/fructose-2, 6- 
bisphosphatase 


0.007 


3036809 


(AL022373) putative protein 


6e-15 


903 


AB0O79O2 


Homo sapiens 
KIAA0442 mRNA, 
partial cds 


0.007 


2662165 


(aboo7902)&HW12cDNa 

clone for KIAA0442 has a 574- 
bp insertion at position 1474 of 
the sequence of KIAA0442. 
(Homo sapiens] 


2e-17 


904 


U93364 


Laciococcus lactis 
crcmoris plasm id 
pN24000 insertion 
sequence IS9S2 
putative transposase 
gene and eps gene 
cluster 

(epsRXABCDEFGHI 
JKL), complete cds 


0,007 


2731377 


[U^a/jy) similar to alcohol 
dehydrogenase/ribitoi 
dehydrogenase [Caenorhabditis 
eiegansl 


Ic-31 


905 1 


AF093268 


Rattus norvegicus 
homer- lc mRNA, 
complete cds 


0.006 


<NONE> 


<NONE> 


<NONE> 


906 I 


AF100694 


Mus musculus 
Pontin52 mRNA, 
sompfcte'eds 


0.006 


<NONE> 


<NONE> 


<NONE> 


907 


1 

AF074386 i 


Sambucus nigra 
levein-like prolein 
nRNA. complete cds 


0.006 


<NONE> 


<NONE> 


<NONE> 


908 


i 
c 
c 
I 

AF027174 c 


Vrabidopsis thaliana 
ellulose synthase 
atalytic subunit(Ath 
J) mRNA, complete 
ds 


0.006 


<NONE> 


<NONE> 


cNONE> 


909 | 


i 
n 
n 

AJ005813 e 


irabidopsiij ihalianu 
iRNA for 
eoxanthin cleavage 
nzyme 


0.006 


<NONE> 


<NONE> 


=NONE> 



WO 01/02568 



PCT/USOO/18374 



|SEQ 
ID 



nearest Neighbor (Blas,N vs. Genbank) T~ NfcaresE Neighb ^ to^x ZH ^**^ p^-i 



p [accession 



910 I AFQ27I74 



911 | AF093268 



912 I AFQ93268 

913 I AB012106 



914 I AF064O29 



DESCRIPTION 



P VALUE [ ACCESSION 



Arabidopsis thaliana 
cellulose synthase 
catalytic subunh (Aih- 
B) mRNA, complete 
cds 



Raitus norvegicus 
homer- lc mRNA, 
complete cds 



Rattus norvegicus 
homer- IcmRNA, 

complete cds 

Brass ica rapa mRNA 
for SRK45, complete 
cds 



915 1 AF1QQ694 



916 1 AFQ93268 



917 I AF100694 



918 I AF012899 



919 I X3Q289 



HeJianthus tuberosus 
lectin I mRNA, 
complete cds 



Mus musculus 
Pontin52 mRNA, 
complete cds 



Rattus norve^icus 
homer- lc mRNA, 
complete cds 



0.006 



<NONE> 



0.006 



<NONE> 



0,006 
0,006 



<NONE> 
<NONE> 



0.006 



<NONE> 



0.006 



<NONE> 



Mus musculus 
Pontin52 mRNA, 
complete cds 



Sambucus ni^ra 
ribosome inactivaiinij 
protein precursor 
mRNA, complete cds 



DESCRIPTION IpVAUJeI 



<NONE> 



[<NQNE>1 



<NONE> 



|<NONE>l 



<NONE> 
_<NONE> 



<NONE> 



|<NONE>| 



<NONE> 



0,006 



4049856 



(AF063866) ORF MSV064 
hypothetical protein 
[Melanopius sanguinipes 
cntomop oxvinis] 



|<NONE>| 



0.006 



3880536 



H.sapiensPTPLl 
mRNA for protein 
tyrosine phosphatase 



0.006 



3877761 



0,006 



II 68791 



UXIUIU) predicted using ^ — " 
Genefinder; similar to Lectin C- 
type domain short and long 
forms (2 domains); cDNA EST 
EMBLC10633 comes from this j 
gene; cDNA EST 
EMBL:C12424 comes from this j 
gene; cDNA EST yki9lej,3 
comes from this ... 



9.6 



7,9 



(ZS1552) F56G4.1 
Caenorhabditis elegans] 
>gi|38786l5|gnI|PID|el348240 
(ZS3H8) F36G4.1 



7.5 



CATHEPSIN E PRECURSOR 
precursor • rabbit >gi|402729 
(LQS41S) procathepsin E 



7.4 



WO 01/02568 



PCT/US00/I8374 





ED 


M Nearest 

Iaccessioi 


Neighbor (BListN *s. 
*\ DESCRIPTION 


Gen bank) 
P VALUE 


J Neansi Neighbor ( BlastX vs. Non-Redundant Protein*! "] 

I ACCESSION 1 DESCRIPTION Ipuat iml 


920 


I AF074386 


Sambucus nigra 
he vein-like protein 
mRNA, complete cds 


0.006 


I 1346371 


RINA^L BET* — 

DIACYLGLYCEROL 

IIVUlAoC ) 

>gi|477059|pir||A47744 
Idiacylglycerol kinase (EG 
|2,7. 1 . 107) beta - rat 90kDa- 
Idiacy) glycerol kinase [Rattus 


53 | 


921 


U72396 


Lycopersicon 
esculcntum class II 
small heat shock 
protein te HSP 17.6 
mRNA. complete cds 


0.006 


I 2196567 


(DS8588) lipoprotein 
[[Escherichia coli] 


1 4.3 j 


922 


AF074387 


Sambucus nigra 
hevein-Iike protein 
mRNA, complete cds 


0.006 




|( 283259) AmphiBrf38 
IBranchiostoma floridael 


4.3 1 


_923 1 


AB0121O6 


Brassica rapa mRN A 
for SRK45, complete 
cds 


0.006 


1388166 


(U58282) Bowel [Drosophila 
melano^aster] 


4,3 j 


.924 


AF074386 


Sambucus nigra 
hevein-jike protein 
mRNA, complete cds 


0,006 




HYPOTHETICAL 20.1 KD 
PROTEIN Y4YS 


4.2 j 


925 J 


AF012899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


0,006 


416592 


A-ArJGLUTINliN 

ATTACHMENT SUB UNIT 

PRECURSOR 

>gi|lUl I /U|pir||A4I258 a- 

jy£iutinin core protein AGA1 - 1 

veast (Saccharomyces 1 

;erevisiae) | 


2.7 1 




926 I 


1 
1 

AF064029 c 


4elianthus tuberosus 
ectin I mRNA, 
complete cds 


o nn<s 1 


1 

1; 
\\ 

416592 L 


vAOiLUTIMK f 

ATTACHMENT SUB UNIT 
PRECURSOR 
>gi|101l70|pir|jA4 1258 a- 
igillutinin core protein AGA1 - 
east (Saccharomyces 1 
erevisiae) 1 


2.5 1 




927 J 


j 

T 

r 

AJ005813 


Vrabidopsis thaliana 
nRNAfor 
eoxanthin cleavage 
nzvme 


0.006 


It 
|< 

h 

3258584 h 


1/41263) The 3' UTRotthis j 
cne overlaps the 3' UTR of 1 
"19Dl2.6(confinned by EST 
its) [Gienorhabditis eleeans] | 


2.0 J 




928 1 


c 

a 

U33949 c 


uman Down 
yndrome region of 
hromosome2L 
enomic sequence, 
lone A12HI-1A6. 


0.006 | 


3850997 |,\ 


\FO67150) beta-hydroxyacyl- I 
CP dehydratase precursor | 


19 J 
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Nearest Neighbor (BlastN vs. Genbank) 



SEQ 

fD Iaccession 



DESCRIPTEON 



11831 AF090115 



Lycopcrsicon 
csculeruum cytosolic 
class II small heat 
shock protein HCT2 
(HSP17.4) mRNA, 
complete cds 



ISambucus nigra 
Jribosome inactivating 
protein precursor 
AF012899 Im RNA. complete cds 



Nearest Neighbor (ttlastX vs. N""-Rgdundant Proteins)" 



P VALUE I Acrasrrw 



2e-04 



,729008 



Sambucus nigra 
I Jhevein*like protein 

1185| AF0743 86 jmRNA. comp lete cds 



2*4)4 



2507582 



2e-04 



1186 1 AFQ27173 



lArabidopsis thaliana 
fcellulose synthase 
[catalytic subunit (Ath 
A) mRNA, complete 
cds 



1187| AJ005813 



_t 133 1 AFQ27174 



lArabidopsis thaliana 
ImRNA for 
Ineoxanthin cleavage 
enzyme 

lArabidopsis thaliana 
(cellulose synthase 
catalytic subunit (Ath 
IB) mRNA, complete 
Icds 



1085500 



2e-04 



2623967 



2e-04 



2497316 



2e-04 



DESCRIPTION 



PRECURSOR (TYROSINE- 
PROTEIN KINASE CAK) 
(CELL ADHESION KINASE) 
(TYROSINE KINASE DDR) 
(DISCOIDIN RECEPTOR 
TYROSINE KINASE) (TRK E) 
(PROTEIN-TYROSINE 
[KINASE RTK 6) sapiens] 
HVPOlHtllLAUJS.iKD 



PROTEIN IN MOLR-BGLX 
INTERGENIC REGION 
>gi|I78S436 (AE000300) 
putative regulator [Escherichia 

coltl 

collagen alpha l(DC) chain - 
I mouse muscuius] 
> g i|744962|prfl|20 1 5346 A 

JcolIagen:SUBUNIT=alphal :ISO 
lTYPE=IX [Mus musculusl 



(YI3942) GTN Reductase 
l Agrobaaerium radiobacter] 

IcLYCOS YLATION END 
PRODUCT-SPECIFIC 
RECEPTOR PRECURSOR 
(RECEPTOR FOR 
ADVANCED 
GLYCOS YLATION END 
PRODUCTS) products receptor 
precursor > boWne >gi|16365l 

J(M91212> receptor for advanced 
glycosylacion end products [Bos 

Itaurus" 



JQ01710 l(D64QQ4) hvp or hctical protein 



8.3 



7.8 



7.8 



7,4 



5.3 



3.5 



WO 01/02568 



PCT/US00/I8374 



SEQ 
ID 


Nearest 
ACCESSION 


Neighbor (BlasiN vs. ( 
DESCRIPTION 


3enbank) 
P VALUE 


Nearest Neieh 
ACCESSION 


?or (BlastX v$. Non-Redundant Pi 
DESCRIPTION 


-Oteins) 
P VALUE 


1189 


AJ0058L3 


Arabidopsis thaliana 
mRNA for 
neoxanthin cleavage 
enzyme 


2e-04 


3258584 


(U41263) The rutkot this 
gene overlaps the 3' UTR of 
T19D12.6(confirmed by EST 
hits) [Caenorhabditis elecansl 


2.1 


1190 


AF027173 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit ( Ath- 
A) mRNA, complete 
cds 


2c-04 


2736338 


(AF038623) contains similarity 
to RNA recosnition motifs 


0.39 


1191 


U72396 


Lycopersicon 
esculcntum class Q 
small heat shock 
protein Le-HSP17.6 
mRNA. complete cds 


2e-04 


2196567 


(D88588) lipoprotein 
(Escherichia coli] 


0.69 


1192 


AF090U5 


Lycopersicon 
esculentum cytosolic 
class II small heat 
shock protein HCT2 
(HSP17.4) mRNA, 
complete cds 


2e-04 


3319874 


(AJ006096) F-spondin 
IBranchiostoma floridael 


5e-04 


1193 


L26049 


Chlamydomonas 
reinhardtii dyneln 
heavy chain alpha 
(ODAU)gene, esoris 
2-15, and partial cds. 


2e-04 


3876775 


(Z81077) predicted using 
Gertefinder; Similarity to Yeast 
protein 8248 (TR:G58753l) 


2c-09 


1194 


AF I 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-04 


<NONE> 


<NONE> 


<NONE> 


1195 


AF064029 


tfclianthus tuberosus 
lectin i mRNA, 
complete cds 


lc-04 


<NONE> 


<NONE> 


<NONE> 


1196 


1 

L34219 


Homo sapiens 
-e ti na Idehy de- bi nd i ng 
protein (CRALBP) 
zene, complete cds. 


le-04 


<NONE> 


<NONE> 


<NONE> 


1197 


] 

X5I890 i 


Rhesus monkey 
nterleukin-3 gene 


le-04 


<NONE> 


<NONE> 


<NONE> 



9Jf5 



WO 01/02568 
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SEQ 
ID 


Nearest 
ACCESSION 


Neighbor fBlastN vs. ( 

1 DESCRIPTION 
Plasmodium 


lenbank) 
P VAJLUE 


Nearest Neieh 
ACCESSION 


)0r(BlastX vs. Non-Redundant R 
DESCRIPTION 


roteins) 
P VALUE 


L198 


AE001421 


falciparum 
chromosome 2, 
section 58 or 73 of 
(he complete 
sequence 


lc^04 


<NONE> 


<NONE> 


<NONE> 


1199 


AF090115 


Lycopersicon 
escuientum cytosolic 
class II small heat 
shock protein HCT2 
(HSP17.4) mRNA, 
complete cds 


le=04 


<NONE> 


! <NONE> 


<NONE> 


1200 


AF027174 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
B) mRNA. complete 
cds 


le-04 


2576287 


(Yl5086)HcpC protein 


4.7 


1201 


AJ005813 


Arabidopsis thaliana 
mRNA for 
neoxanthjn cleavage 
enzvme 


le-04 




(AB016623)RWC-3[Oryza 


0.14 


1202 


AFO38035 


Homo sapiens 
BRCAl-associated 
RING domain protein 
(BARD1) gene, 
axons 2 and 3 


9e-05 


<NONE> 


<NONE> 


<NONE> 


1203 


AJ005813 


Arabidopsis thaliana 
mRNA for 
neoxamhin cleavage 
cnzvme 


9e-05 


<NONE> 


<NONE> 


<NONE> 


1204 


AB012106 


Brassica rapa mRNA 
For SRK45, complete 

:ds 


9e-05 


<NONE> 


<NONE> 


<NONE> 


1205 


i 
I 

U95098 i 


Kenopus laevts 
mitotic 

)hosphoprotein 44 
nRNA, partial cds 


9e-05 


<NONE> 


<NONE> 


<NONE> 


1206 


I 

s 

AF034099 r 


: accaria bicolor 
[lyoxal malate 
ynthase protein 
nRNA, complete cds 


9e-05 


1 

1 
I 

\ 
\ 

1351553 ; 


HYPOTHETICAL 
LIPOPROTEIN MG348 
PRECURSOR 
>gi|l361668|pir||E64238 
lypotheiieal protein MG348 - 
Mycoplasma genitalium (SGC3) 
►gi|3844931 


8.3 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neiehbor (BlastX vs. Non^Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1207 


D50O06 


Human DNA for 

nlnhi-nlifpt^t-fi/^ri v^ri 

growth factor 
receptor, exon 6-10 


9e-05 


3063639 


dehydrogenase subunit 5 
[Panorpa japonica] 


51 


1208 




Human Down 
Syndrome region of 
chromosome 21, 

Mr\T\f» Ad IRS IR7 
cionc .-\*+ I DO" ID/. 


ye^Uj 




1NH1BIN ALPHA CHAIN 
PRECURSOR bovine 
>Si|l63l95 (M 13273) inhibin A 
subunit [Bos taurusj 


3.0 


1209 


AJ005813 


Arabidopsis thaliana 
mRNA for 

llCUA^JllUIlll LICaVtlgC 

enzyme 


9e-05 


4007782 


(X72850) 2.4- 
dihydroxybenzoate 
monooxygenase [Sphingomonas 
sp.| 


2.3 


1210 


AC005276 


Homo sapiens done 
fraoment 

UWGC:gap3 from 
7q3 13, complete 

sapiens] 


9e-05 


1492075 


(UoOjIj) MCIj>2L [Molluscum 
contagiosum virus subtype I] 


L0 


1211 




Mus m use u las 
Pontin52 mRNA, 
complete cds 


9e-05 


2887423 


(AB007884) FOAA0424 [Homo 
sapiens) 


2e-10 


1212 


a / / / 


Cfuscus gamma-M2- 
i irystaiun nruw*A. 






(U83 1 15) non-lens beta gamma- 
crystallm like protein [Homo 
sapiens 1 


7e-25 


L213 


AB012106 


Brass lea rapa mRNA 
for SRK45, complete 

cds 


8e-05 


<NONE> 


<N0NE> 


<NONE> 


1214 


L06I78 


Apis meliirera 

mitochondrial 
genome 


Se-05 


<NONE> 


<NONE> 


<NONE> 


1215 


ABO 12 106 


Brass ica rapa mRNA 

lor jrvtv-*j T lunipiGLC 

cds 


8e-05 


<NONE> 


<NONE> " 


<NONE> 


1216 


U76524 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 


8e-05 


<NONE> 


cNONE> 


<NONE> 


1217 


L06178 


Apis mellifera 
ligustica complete 
mitochondrial 
iienome 


Se-05 


<NONE> 


<NONE> 


<N0NE> 


1213 


ABO 1 2 106 


Brassica rapa mRNA 
for SRK45, complete 
:ds 


Se*05 


<NONE> 


<NONE> 


<N0NE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundanr Prrorinsl 


SEQ 
D 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


d vat rrc 
r value 






Mus musculus 










1219 


AF 100694 


Pontin52 mRNA. 
complete cds 


8c-05 


<NONE> 


<NONE> 


<NONE> 


1220 


ADUIZIUO 


Brassica rapa mRNA 
forSRK45, complete 

COS 


OC-UJ 


<INUIN.fc> 


<NONE> 


<NONE> 


1221 


AB0L2106 


Brassica rapa mRNA 
for SRK45, complete 
cds 


8e^05 


1722841 


WNT-11 PROTEIN 
PRECURSOR (XWNT4 1) 
clawed frog >gi|439l08 
(L23542) maternal protein 


9.9 


1222 


AF027L74 


Arabidopsis thaliana 
cellulose synthase 
catalytic suounu ^/\in- 
B) mRNA* complete 
cds 


8e-05 


1205991 


(U35637) nebulin [Homo 
sapiensl 


9.6 


1223 


AF024605 


Homo sapiens serine 
protease*! ike protease 
Sequence 2 from 
patent US 5736377 


8e-05 


3242783 


(AF055354) respiratory burst 
oxidase protein B 


8.6 


1224 


Y13L48 


Ranus norvegicus 
mKNA tor r'AuoUo 
gene 


8e-05 


2314243 


(AE0006I6) alpha- ketog I u urate 
permease (ketP) 


8.1 


1225 


AJO05813 


Arabidopsis thaliana 
mRNA for 
neoxanimn cieavagc 
enzyme 


8e-05 


1 L70586 


KA5 U 1 VASfc-AL 1 IV A11MU- 
UKE PROTEIN 1QGAPL 
(P195) (KIAA0051) 
>gi|627594|pir||A54854 Ras 
GTPase activating -related 
protein - human sapiens] 
>gi|536344 (L33075) ras 
GTPase-activating-likc protein 
Homo sapiens) 


7.8 


1226 


AF027173 


Arabidopsis thaliana 
cellulose synthase 
catalytic subumi (Ath- 
A) mRNA, complete 
cds 


8c-05 


464239 


NADH-UB IQUINONE 
OXIDOREDUCTASE CHAIN 
4>gi|1035l85|pir||S52968 
NADH dehydrogenase chain 4 - 
honeybee mitochondrion 
(SGC4) >gi|552446 (L06178) 
NADH dehydrogenase subunit 4 
Apis mellifera ligustica] 


3.5 


1227 


AF 100694 


Mus musculus 
Ponlin52 mRNA. 
complete cds 


8e 05 


544353 


F-SPONDIN PRECURSOR 


3-5 



9m i 
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Nearest Neiehbor iBiastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1223 


U76524 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 


8e-05 


483243 


(fragment) 


3.4 


1229 


AF093268 


Rattus norvegicus 
homer- Ic mRNA, 
complete cds 


8e*05 


91207 


proline-rich protein - mouse 
(fragment) musculus] 


2.2 


1230 


AF027173 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
A) mRNA comolete 

IMAM 1 4 1^ \*YJ* * Ifc^*^" * 

cds 


8e-05 


2499181 


>2i| 1066466 


2.2 


1231 


AF027173 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
A) mRNA* complete 
cds 


8e-05 


2499181 


ZONADHESIN PRECURSOR 
>gi|1066466 


19 


1232 


ABO 12 106 


Brassica rapa mRNA 
for SRfC45, complete 
cds 


8e-05 


2833647 


( AFfY?7977 , i fHopttifnrm cilk 

\rVx\JiL i 7 f 4-/ im^CllllUlfU ^1 lit 

protein fNcphila clavipes] 


1.6 


1233 


AF093768 


Rattus norvegicus 
homer- 1 c mRNA, 


8e-05 


1163063 


(Z49821) MY02 

JuLCIiaiUUk v l. Co V-CICVI^lilC] 


n on 


1234 


AF027174 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit ( Ath- 
B) mRNA, complete 
cds 


8e-05 


1653488 






1235 


M26510 


Chicken nonmuscle 
myosin heavy chain 
(MHC) gene, 
complete cds. 


Se-05 


112159 


plectin - rat 


0.003 


1236 


U56402 


Human chromatin 
structural protein 
homo!o£ 


8e-05 


2088823 


AF003384) weak similarity to 
the peptidase family A2 


le 13 


1237 


AF 100694 


Mas musculus 
Pontin52 mRNA, 
complete cds 


8e-05 


437181 


(U02289) GTPase-activating 
protein [Caenorhabditis clegans] 


2e-17 


1238 


AF 100694 


Mus musculus 
Pomin52 mRNA, 
complete cds 


8e-05 


465983 


HYPOTHETICAL 80 S KX> 
PROTEIN ZC21.4 IN 
CHROMOSOME III 


8e-27 



9-4 <l 
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SEQ 
ID 


Nearest 
ACCESSION 


Neighbor (BlastN vs. C 
f DESCRIPTION 


jenbank) 
P VALUE 


Nearest Neiehi 
ACCESSION 


3or(BlastX vs. Non-Redundant P 
DESCRIPTION 


mteins) 
P VALUE 


1239 


AF09O1 15 


Lycopersicon 

esc ulen turn cvtosolif* 

class II small heat 
shock protein HCT2 
(HSP17.4) mRNA, 

lumpicic cuS 






<NONE> 


<NONE> 


1240 


U83656 


Rattus norvegicus NF 
KB gene Y promotor 
region 


7e-05 


3880858 


(AL031633) predicted using 
Genefinder; cDNA EST 
yk304fI2.5 comes from this 
gene [Caenorhabditis elesansl 


9.3 


1241 


AF07*nR7 

rirv/*tJO / 


Sambucus nigra 
hevein-like protein 
mRNA, complete cds 


7e-u5 


3080538 


(AL0226O0) hypothetical 
protein 


9.2 


1242 


XS9398 


H,sapiens ung gene 
tor uracil Uri 
glycosvlase 


. 7e-05 


549700 


HVPUlHtllLAL 23./ KD 

PROTEIN IN MDH1-VMA5 
JNTERGENIC REGION 
>gip^91S2|pir||Sj7908 
hypothetical protein YKL083w - 
yeast (Saccharomyces 
ccrevisiae) >gi|486120 
(Z280S2) ORF YKL083w 


1.8 


1243 


M83753 


Bovine follicle 
stimulating hormone- 
ucia suDumi gene, 
complete cds. 


7e-05 


2398621 


(AJ000342) DMBTl protein, 
5.8 kb transcript [Homo sapiens) 


1.8 


1244 


M80829 


Rat troponin T 
cardiac isoform gene, 
complete cds 


5c-05 


854065 


(XS3413) US 8 [Human 
herpesvirus 61 


2e-08 


1245 


AF074387 


Sambucus nigra 
hevein-like protein 
rnRNA, complete cds 


4e-05 


120240 


FLAGELLIN B2 PRECURSOR 

vlethanococcus voltae 

>gi| 150063 fM72U8)naeellin 


5-2 


1246 


AFO 12899 


^imhnf lie rticrr.l 

ribosome inactivating 
protein precursor 
rnRNA. complete cds 


3e-05 


<NONE> 


<NONE> 


<NONE> 


1247 


U76524 


Sambucus nigra 
ribosome inactivating 
protein precursor 
rnRNA, complete cds 


3e-05 


<NONE> 


<NONE> 


<NONE> 


1248 


1 

AF074386 i 


Sambucus nigra 
levein-like protein 
tiRNA, complete cds 


3e-05 


<NONE> 


<NONE> 


<NONE> 



■ . 



WO 01/02568 



PCT/US00/18374 





Nearest Neighbor 'BlastN vs. Genbank) 


i Nearest Neighbor (BlustX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 




Rattus norvcgicus 










1249 


AF093268 


homer- lc mRNA, 
complete cds 


3e-05 


<NONE> 


<NONE> 


<NON£> 


1250 


AB012106 


Brassica rapa mRNA 
for SRK45, complete 
cds 


3e-05 


2773226 


(AF0397 16) Similar to protein 
kinase [Caenorhabditis elegans] 


6.7 


1251 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


3e*05 


2072961 


(U93568) putative pl50 [Homo 
sapiens] 


5.6 


1252 


U72396 


Lycopersicon 
escuJentum class 0 
small heat shock 
protein Lc-HSPl7.6 
mRNA, complete cds 


- 

3e-05 


121855 


kiOGLUCANASb U 
PRECURSOR cellulose L4-beta 
cellobiosidase (EC 3.2,1.91) n 
precursor - fungus (Trichoderma 
reesei) l T 4-beta-cellobiosidase 
(EC 3.2. 1.91) II -fungus 
cellobiohydrolase II 
[Trichoderma reesei] 


4,6 


1253 


U76524 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 


3e 05 


3880516 


(AL021572) similar to CTP 
SYNTHASE (EC 6.3.4.2) (UTP* 
-AMMONIA LIGASE) (CTP 
SYNTHETASE) 


3.3 


1254 


M88299 


Mouse brain- 1 POU- 
domain protein, 
complete cds. 


3e-05 


1947048 


(U66 102) imimin [Escherichia 
colil 


3.0 


1255 


U9509S 


Xenopus laevis 
mitotic 

phosphoprotein44 
mRNA, partial cds 


3e-05 


3122872 


CELLCYCLH nuclear 

AUTO ANTIGEN SG2NA 
(S/G2 NUCLEAR ANTIGEN) 
>gi|l082650|pir||JC2522 nuclear 
autoantigen - human >gi|S05095 
(UI7989)GS2NA 


2,8 


1256 


U76524 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


3e-05 


1352145 


CYTOCHROME C OXIDASE 

POLYPEPTIDE I chain I - 
Thermos aquaticus >gi| 155083 
M84341) cytochrome c oxidase 
subunits precursor [Thermus 
thermophilusl 


2.6 


1-257 


U72396 


Lycopersicon 
esculentum class 11 
small heat shock 
protein Le-HSP 17.6 
mRNA, complete cds 


3e-05 


2811015 


SEGMENTATION pOLaRiTY 
PROTEIN ENGRAILED 
>gi|2076747 (U42429) 
engrailed (Anopheles gambiae] 
>gi|2l4S9lS(U42214) 
engrailed (Anopheles gambiae) 


2.0 
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Nearest Neighbor f BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1258 


AF027174 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit ( Ath- 
B) mRNA, complete 
cds 


3e-05 


1657752 


(U62325) FE65-like protein 
(Homo sapiens! 


L7 


1259 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


3e-05 


2072961 


(U93568) putative pl50 [Homo 
sapicnsl 


L5 


1260 


U76523 


Sambucus nigra lectin 
precursor mRNA. 
complete cds 


3e-05 


1352145 


CYTOCHROME C OXIDASE 
POLYPEPTIDE I chain I - 
Therm us aquaticus >gi|l 55083 
(M84341) cytochrome c oxidase 
summits precursor [Thermus 
thermophilusl 


n 


1261 


X91890 


H.sapiens regulatory 
region ofHOXA7 
gene 


3e-05 


UI013 


Sxr (Bkm- homology sex- 
determining region protein - 
mouse 


1.0 


1262 


L36936 


Homo sapiens mctase 
gene, partial cds. 


3e-05 


j944352 


(D84239) IgG Fc binding 
protein (Homo sapiens] 


0.99 


1263 


AB0I2105 


Brassica rapa mRNA 
for SLG45. complete 
cds 


3e-05 


417782 


SMP2 PROTEIN . 
>gi|320853|pir||S3091i SMP2 
protein - yeast (Saccharomyces 
cerevisiae) gene 
(Saccharomyces cerevisiael 


0.S9 


1264 


U76524 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


3e-05 


1708501 


INTEGR1N ALPHA CHAIN- 
LIKE PROTEIN alpha Intlp 
[Candida albicans] 


0.39 


1265 


AF090 U5 


Lycopersicon 
esculentum cytosolic 
class U small heat 
shock protein HCT2 
(HSP17.4) mRNA. 
complete cds 


3e-05 1 


1587031 


cis-Golgi matrix protein GM130 
Rattus norvejeicus] 


0.20 


1266 


ZS1014 


tluman DNA 
sequence from 
cosmid U65A4, 
between markers 
DXS366 and DXSS7 
on chromosome X * 


3e-05 


2072964 


(U93569) putative p 150 [Homo 
sapiens) 


0.049 
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sec; 
id 


Nearest 
ACCESSION 


Neighbor (BlastN vs. C 
DESCRIPTION 


Jenbank) 
P VALUE 


Nearest Neishl 
ACCESSION 


>or (BlastX vs. Non-Redundant Pi 
DESCRIPTION 


poceins) 
P VALUE 












glycosylatea ana mynstilated 




1267 


296668 


Ksapicns telomeric 
DNA sequence, clone 
7PTEL0Ol,read 
7PTELOO001.seq 


3e-05 


542429 


smaller sunace antigen - 
Plasmodium falciparum 
>gi|S36640 (X76298) 
glycosylated and mynstilated 
smaller surface antigen gallusj 
>gi|1092l78|prf)|2023l65B 
surface antigen 


0,029 


126S 


ABO 12 105 


Brassica rapa mRNA 
for SLG45, complete 
cds 


3e-05 


3879121 


(Z7031O) predicted using 
Gene finder; Similarity to Mouse 
ankyrin (PIR Acc. No. S37771); 
cDNA EST EMBL:T01923 
comes from this gene; cDNA 
ESTEMBL:D32335 comes 
from this gene; cDNA EST 
EMBL:D32723 comes from this 
gene; cDNA ES... Gcnefinder, 
Similarity to Mouse ankyrin 
(PIR Acc. No, S37771); cDNA 
EST EMBL-T01923 comes 
from this gene; cDNA EST 
EMBL:D32335 comes from this 
gene; cDNA EST 
EMBL:D32723 comes from this 
gcne;cDNA ES... 


2e-13 


1269 


AF074385 


Sambucus nigra 
he vein-like protein 
mRNA. complete cds 


3e-05 


2497677 


2YXIN (ZYXIN 2) sapiens) 
>gi| 1 545954|gnI|PID|e2234 1 7 
(X95735) zvxin 


2e-23 


1270 


■ 

AF027174 


Arabidopsis thai i ana 
cellulose synthase 
catalytic subunit (Ath- 
d) mKJNA ? complete 
cds 


le^05 


<NONE> 


<NONE> 


<NONE> 


1271 


X1631S 


Canine mRNA for 
signal recognition 
panicle 54k protein 


le-05 


] 
( 

3122612 I 


PITUITARY HOMEOBOX 3 
HOMEOBOX PROTEIN 
PITX3) >gi|2645427 
AF005772) homeobox protein 
3 itx3 fMus m use u I us) 


4.4 


1272 


ABO 12 105 < 


Srassica rapa mRNA 
for SLG45, complete 
:ds 


ie-05 


( 
1 

165245S 5 


D90905) DNA mismatch repair 
jroiein MutL [Synechocystis 

pi 


0.62 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1273 


U57843 


Human 

phosphatidylinositol 
3-kinase delta 
catalytic subunit 
mRNA. complete cds 


le-05 


475909 


(X67098) ORF1A IHomo 
sapiens] 


0.22 


1274 


Z96569 


H-sapiens telomeric 
DN A sequence, clone 
2QTEL054, read 
2QTELOO054.seq 


le-05 


2137043 


unknown protein - rabbit 
(fragment) cuniculus] 


0.005 


1275 


AE000310 


Methanobai-ierium 
thermoautotrophicum 
from bases 172512 to 
1 82957 (section 16 of 
148) of the complete 
genome ! 


le-05 


■ 

3877579 


kinensin-like protein KIF4 
(SW:P33174); cDNA EST 
EMBL:D27320 comes from this 
gene; cDNA EST 
EMBL:D27322 comes from this 
gene;cDNA EST 
EMBL;D27321 comes from this 
gene; cDNA EST 
EMBLD35764 comes... Mouse 
kinensin-like protein KIF4 
(SW;P33174); cDNA EST 
F\/fftI 'TYHVlCl enmes Frnm this 

gene; cDNA EST 
EMBL:D27322 comes from this 
gene; cDNA EST 
EMBL:D27321 comes from this 
gene; cDNA EST 
EMBL:D35764 comes... 


6e-27 


1276 


ABO L2 1 1 3 


Homo sapiens gene 
forCCchemokine i 
PARC precursor, 
complete cds 


9e-uo 


<INUlNt> 


, <tNVJ[NtL> 




1277 


ACOO5S30 


Homo sapiens Xpll- 
154*155 B AC G5HB- 
52411 (Genome 
Systems Human BAC 
Library), complete 
sequence [Homo 
sapiens] 


9e-06 


<NONE> 


<NONE> 


<NONE> 


1278 


DS6245 


Human MHC (HLA) 
DRB intron 1 DNA, 
partial sequence 


9e-06 


1051253 


(U37531) mucin apoprotein 
[Mus musculusl 


1.3 


1279 


D79998 


Human mRNA for 
KIAA0176gene. 
partial cds 


9e-06 


2833253 


HYPOTHETICAL PROTEIN 
K1AA0176 sapiens] 


4e^06 



<Z54 
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Nearest Neighbor (BlastN vs. Genbank) 


! Nearest Neiehbor (BlasiX vs, Non-Redundant Proteins) 


SEQ 
IP 






r VAi-Uc 


Auu£55ION 




P VALUE 










(z,oyo^S> aimuanty to^east 




1280 


U 10246 


Toxoplasma gondii 
RH uracil 
phosphoribosyl 
transferase gene, 
complete cds. 


9e-06 


3876090 


undine Kinase 

(SW;URK1_YEAST); cDNA 
EST EMBL:Z14695 comes 
from this gene; cDNA EST 
CEMSE17F comes from this 
gene; cDNA EST 
EMBL:D67355 comes from this 
gene; cDNA EST yk209hL5 
comes from this ae.„ 


7e-33 


1281 


U 10246 


Toxoplasma gondii 
RH uracil 
phosphoribosyl 
transferase gene, 
complete cds. 


9e-06 


3876090 


(IWblh) Similarity (6 Yeast 
undine kinase 

(SW;URKUYEASD: cDNA 
ESTEMBL:Z14695 comes 
from this gene; cDNA EST 
CEMSE17F comes from this 
gene; cDN A EST 
EMBL:D67355 comes from this 
gene; cDNA EST yk209h 1.5 j 
comes from this sc.. 


7e-34 


lioz 


AF0 12899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 


8e-06 


<NONE> 


.i 

<NONE> 


<NONE> 




AF012899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


8e-06 


<NONE> 


<NONE> 


<NONE> 


1284 


U66340 


Human Rh blood 
group C antigen 
(RHCE) gene, exon 
2. partial cds 


8c-06 


1707155 


(U80837) F07E5.6 gene product 
[Cacnorhabditis elegans] 


9.6 


1285 


AF012899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


7e-06 


<NONE> 


<NONE> 


<NONE> 


1286 


M29930 


Human insulin 
receptor (allele 2) 
gene, exons 14, 15, 
16 and 17, 


4e-06 


<NONE> 


<NONE> 


<NONE> 


1237 


L42103 


Homo sapiens 
(subclone 5_d3 from 
PI H25) DNA 
secjuence. 


3e-06 


<NONE> J 


- <NONE> 


<NONE> 



05Z 
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Nearest Neighbor 'BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Mus musculus 










1283 


AFO 12244 


cerberus-like (Cer-1) 
gene, complete cds 


3e-06 


<NONE> 


<NONE> 


<NONE> 


1289 


269366 


Human DNA 
sequence from 
cosmid L96F8, 
Huntington's Disease 
Region, chromosome 
4p 16.3 contains EST. 


3e-06 


<NONE> 


<NONE> 


<NONE> 


1290 


Z69366 


Human DNA 
sequence from 
cosmid L96F8, 
Huntington's Disease 
Region, chromosome 
4pl&3 contains EST 


3e-06 


<NONE> 


<NONE> 


<NONE> 


1291 


X85232 


H.sapicns 
chromosome 3 
sequences 


3e-06 ! 


<NONE> 


<NONE> 


<NONE> 


1292 


M32674 


Human platelet 
glycoprotein Ilia, 
exons 7, S and 9. 


3e-06 


<NONE> 


<NONE> 


<NONE> 


1293 


D16S79 


Human HepG- 3' 
region cDNA. clone 
hmd2aGt 


3e-06 


998296 


(U33484)epcndymm 
[Hemiodiis sp] 


5.6 


1294 


U186I4 


Lagothnx lagotncha 
interphotorecepior 
retinoid-binding 
protein (IRBP) gene, 
intron I. complete 
sequence 


3e-06 


1613846 


(U71440) polyprotein [Rice 
tungro spherical virus] 


5.0 


1295 


AF09O115 


Lycopersicon 
esculentum cytosolic 
class II small heat 
shock protein HCT2 
(HSPI7.4) mRNA, 
complete cds 


3e.06 


1477646 


(U53204) plectin [Homo 
sapiens] >gi |1 47765 1 (U636 10) 
plectin [Homo sapiens] 


4.0 


1296 


AFO 16898 


Homo sapiens B-ATF 
gene, complete cds 


3e-06 


1085177 


reverse transcriptase - fruit fly 
reverse transcriptase 
Drosophila yakuba] 


3.0 


1297 


ABO L 8490 


Homo sapiens DNA, 
irinucleutide repeats 
reeion 


3e-06 


3876572 


(Z8I522) predicted using 
Geneflnder; similar to RNA 
recognition motif, (aka RRM, 
RBD, or RNP domain) 
Caenorhabditis elegans] 


3.0 
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Nearest Neighbor fBlastN vs, Genbank) 


Nearest Neighbor (BlasiX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1298 


AF027174 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
B) mRNA, complete 
cds 


3e-06 


4240137 


(AB020631) KIAA0824 protein 
[Homo sapiensl 


2.7 


1299 


M37929 


Homo sapiens 
adenosine 
monophosphate 
deaminase 1 
(AMPDL)gene, 
exoris 1M2, 


3e-06 


• 1653775 


(D90916) ihiohdisulfide 
interchange protein DsbD 
[Synechocystis sp ] 


L7 


1300 


M37929 


Homo sapiens 
adenosine 
monophosphate 
deaminase 1 
(AMPDl)gene. 
exons 11-12. 


3e-06 


1653775 


(D90916) thio1:disulfide 
interchange protein DsbD 
(Svnechocvstis sp-1 


1.7 


1301 


U60496 


Glycine max actin 
(SoyS6) gene, partial 
cds 


3c-Q6 


1730738 


ACTIN-LIKE PROTEIN ARP5 
Ynl2430p [Saccharomyccs 
ccrcvisiae] 


2e-05 


1302 


X14363 


Yersinia 

pseudotuberculosis 
rpIC, rplD, rplW, 
rplB and rpsS genes 
for ribosomal proteins 
L3, L4X23. L2and 
S19 


3e-06 


585879 


50S RIBOSOMAL PROTEIN 
L2 maritima >gi|437926 
(Z21677) ribosomal protein L2 


2e-12 


1303 


234969 


H.sapiens DNA tor 

microsateliite 

polymorphism 


2e^06 


<NONB> 


<NONB> 


<NONE> 


1304 


X64707 


H.sapiensBBO 
mRNA 


le-06 


<NONE> 


<NONE> 


<NONE> 


1305 


AC005830 


Homo sapiens Xpll- 
154-155 B AC GSHB- 
52411 (Genome 
Systems Human BAC 
Library), complete 
sequence [Homo 
sapiensl 


le-06 


<NONE> ! 


<NONE> 


<NONE> 


1306 


J04058 


Human electron 
transfer flavoprotein 
alpha-subuntt mRNA, 
complete cds. 


le-06 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1307 


L25647 


Homo sapiens 
fibroblast growth 
factor receptor gene 
(located in the central 
MHC) signal peptide 
and consecutive exon 


le-06 


1586734 


mxcQ gene [Mcthylobactcrium 

rtrv n nAn ni 1 1 iwn 1 

wl £.fUIVJ|JIllllJIIl J 




1308 


L26261 


Human MHC class III 
HLA-RP1 gene. 


le-06 


1684985 


(U20633) NADH 
dehydrogenase subunit 
[Neuwiedia veratrifolia] 


1.8 


1309 


fl 1 \J\J ^UJ 


Mus musculus alpha-* 
actinin-2 associated 
LIM protein mRNA. 
alternatively spliced 




7QOA1QA 


(AF053367) carboxyl terminal 
LIM domain protein [Mus 
muscuiusi 


4e- 1 f 


1310 


Ml 0935 


Human haptoglobin 
gene (alpha-2 allele), 
complete cds and 
haptoglobin -related 
gene, exon 1 and 


6e-07 


<NONF_> 






1311 


AC0O2251 


Homo sapiens 
(subclone l_g6 from 
BAC H76) DNA 
sequence 


4e-07 


2144491 


coagulation factor Xa (EC 
3.4.21.6) precursor norvegicus] 


4.2 


1312 


AF047717 


Streptomyces 
chrysomallus 
actinomycin 
synthetase II (acmB) 
srene cnmnletc cds 


4e-07 


699196 


(U15181) 4-coumarate-coA 

lioace fM vcfth*»ciefiunn Ipnmel 
iixum* [in vLuum.tti iuiii itui ut | 


le-06 


1313 


UU417 


Human Ral guanine 
nucleotide 

dissociation 
stimulator mRNA, 
partial cds. 


4e-07 


544402 


OJAfluVfc NULLhUllBfc 
DISSOCIATION 
STIMULATOR RALGDS 
FORM A (RALGEF) 
>gi|32t257|pir|lS2S415 guanine 
nucleotide dissociation 
stimulator ralGDS - mouse 
>gi| 193573 (L07924) guanine 
nucleotide dissociation 
stimulator [Mus musculus] 


Se-OS 


1314 


Z79027 


Ksapiens flow*sorted 
chromosome 6 
Hindlll tragmem. 
SC6pA20G8 


3e-07 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neiehbor (BlastN vs- Genbank) 


1 Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 
Homo sapiens 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


1315 


U67167 


intestinal mucin 
(MUC2) gene, 
promoter region and 
partial cds 


3e-07 


<NONE> 


<NONE> 


<NONE> 


1316 


AF086256 


Homo sapiens full 
length insert cDNA 
clone ZD41C11 


3e-07 


<NONE> 


<NONE> 


<NONE> 


1317 


U67228 


Human clone HS4.61 
Alu-Ya5 sequence 


3c-07 


1938437 


(U970O3) contains similarity to 
C4-type zinc fingers and a 
iigand-binding domain of 
nuclear hormone receptors 


2.3 


1318 


U94346 


Human calpain-likc 
protease (htra-3) 
mRNA. complete cds 


3e-07 


1 2911858 


(AF047659) No definition line 
found (Caenorhabditis eleaans] 


0.39 


1319 


Y 15724 


Homo sapiens 
SERCA3 gene, exons 
1-7 (and joined CDS) 


le-07 


<NONE> 


<NONE> 


<NONE> 


1320 


XI 3596 


Bean DNA for 
glycine-rich cell wall 
protein GRP 1-8 


lc-07 


<NONE> 


<NONE> 


<NONE> 


1 10 1 

1 J-i i 


M83094 


Homo sapiens 
cytosolic selenium- 
dependent glutathione 
peroxidase gene* 
complete cds, and 
rhoh!2 gene. 3" end. 


le-07 


1326385 


(U5875DC07GL7 gene 
product [Caenorhabditis 
elegans] 


8.0 


1322 


255905 


Rsapiens CpG DNA, 
clone 7 lf4. forward 
read epg71f4.ftla . 


le-07 


1076802 


extcnsin-like protein - maize 
>gi|6001 18 mays] 


0-61 


1323 


X03541 


Human mRNA of trk 
oncogene > :: 
gb|I96186|I96IS6 
Sequence 23 from 
patent US 5734039 


ic-07 


325465 - 


M74509) (Human endogenous 
retrovirus type C oncovirus 
sequence J } gene product [Homo 
sapiens] 


3e-04 


1324 


AF027766 


Canis familiaris Y- 
linked zinc finger 
rotein 


le-07 


220643 


D10628) zinc finger protein 
Mus musculusj 


7e-08 


1325 


< 

D136L3 i 


ovine mRNA for 
rabphilin-3A, 
complete cds > :: 
dbj|E07509|E07S09 
:DNA encoding 
Tibphilin-3A 


le-07 


( 

2822161 s 


AC0O4082) rab3 effector- like; 
>5% Similarity to AF007S36 
PlD:g231777S) [Homo 
apiens] 


6e -l I 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
DP 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Human mRN A for c- 






(J04169) gaa^onc fusion protein 




1326 


X57U0 


cbl proto-oncogene 


le-07 


323270 


(Cas NSl retrovirus] 


3e-14 


1327 


X57U0 


Human mRNA for c- 
cbl proto-oncoeene 


le-07 


115855 


PROTO-ONCOGENE C-CBL 
human >gi|2973 i (X57 1 10) c- 
cbl protein [Homo sapiens) 


4e-l9 


1328 


AC00U78 


Homo sapiens 
(subclone 2_g 12 from 
BAC H94) UNA 
sequence 


4e-08 


<NONE> 


<NONE> 


<NONE> 


1329 


tit t GC£. 

Ul 1866 


Human mierleukin-8 
receptor type B 
(ILSRB) gene, 
promoter and exons I- 
6 


4e-08 


<NONE> 


<NONE> 


<NONE> 


1330 


AC001225 


Homo sapiens 
(subclone 2_c6 from 
BAC H94) DNA 
sequence 


4e-08 


478184 


histone HI IM (clone L95) - 
midge 


6.5 


1331 


M73837 


Human modulator 
recognition factor 2 
(MRF-2) mRNA, 
complete cds. 


4e-08 


141448 


HYPOTHETICAL 32.6 KD 
PROTEIN IN TRANSPOSON 
TN4556 >gi[80758|pir||JQ0428 
hypothetical 32.6K protein - 
Streptomyces fradiae transposon 
Tn4556 


4.7 


1332 


AC006164 


Homo sapiens clone 
UWGC:y28gap from 
6p2 1 1 complete 
sequence (Homo 
sapiens] 


4e-08 


2580578 


(AF000996) ubiquitous TPR 
motif, Y isoform [Homo 
sapiens] 


1.2 


1333 


X0I060 


Human mRNA for 
transferrin receptor 


4e-08 


135514 


T-CELL RECEPTOR BETA 
precursor (ANA 1 1) - rabbit 


0,61 


1334 


Y 10697 


Ksapiens INE2 
mRNA 


4e^08 


124909 


lnsULIHr£ceptor^ 
related protein 
precursor (err) (ir- 

RELATED RECEPTOR) 
>gi| 186555 sapiens] 


0-14 


1335 


U60416 


Ratius norvegicus 
myr 6 myosin heavy 
chain mRNA. 
complete cds 


4e-0S 


102189 


myosin L high molecular weight 
- Acanthamoeba sp 


3e-03 ! 
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Nearest Neighbor (BlastN vs. Genbonk) 


Nearest Neighbor (BlasiX vs. Non-Redundant Proteins) 


ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












HYPOTHETICAL ^2KD" 




1336 


U23804 


Drosophila 
melanogaster putative 
GTP-binding^ 
regulatory protein 
beta chain (GPB) 
mRNA. partial cds. 


4e-08 


2494916 


TRP-ASP REPEATS 
CONTAINING PROTEIN 
T10F2.4 IN CHROMOSOME 
III protein; similar to G-Beta 
repeat region (Trp-Asp 
domains) of guanine nucleotide 
binding protein 


ie-28 . 


1337 


AE0OO213 


Escherichia coli K-12 
MG1655 section 103 
of 400 of *e 
complete eenome 


4e-0S 


• 3294172 


(ALba3i5)f'RW3.i.i 

(protein similar to C. elegans 
protein 00035,16) (isoform 1) 
[Homo sapiens] 


2e-67 


1338 


D8982L 


Mus musculus mRNA 
for RhoM, complete 
cds 


2e-08 


3024539 


RHO-RELATED OTP- 
BINDING PROTEIN RHOD 
{RHO- RELATED PROTEIN 
HP I) (RHOHP1) sapiensl 


le-04 


1339 


U74382 


Human telomeric 
repeat DNA-binding 
protein (PtN2) 
mRNA, complete cds 


le-08 


<NONE> 


<NONE> 


<NONE> 


1340 


L35657 


Homo sapiens 
(subclone H8 5_al0 
fromPt 35H5C8) 
DNA sequence. 


le-08 


<NONE> 


<NONE> 


<NONE> 


1341 


L21936 


Human succinate 
dehydrogenase 
flavoprotein subunit 


le-08 


3201678 


(AF060886) adenine 
phosphoribosyltransferase 
Leishmania tarentolae] 


4.0 


1342 


AB0O9777 


Homo sapiens gene 
for osieonidogen, 
promoter region 


le-08 


479388 


triun - wheat 

>si|39 1 929|enI|PIDid 1 003454 


2.2 




M58600 


Human heparin 
cofactor II (HCF2) 
gene, exons 1 through 
5. 


le-08 


1730173 


CLUCOSE-6-PHOSPHATE 
ISOMERASE, CYTOSOLIC 2 
(GPI) (PHOSPHOGLUCOSE 
ISOMERASE) (PGI) isomerase 
Clarkia concinna] 


1,9 


1344 


M5S600 


Human heparin 
cofactor II (HCF2) 
gene, exons I through 
5. 


le-08 


1730173 


GLUCdSE-6-PHOSPHATE 
ISOMERASE, CYTOSOLIC 2 
(GPI) (PHOSPHOGLUCOSE 
ISOMERASE) (PGI) isomerase 
Clarkia concinna] 


1.7 


1345 


AC000980 


Homo sapiens 
(subclone l_g2 from 
PI H31) DNA 
sequence 


le-08 


439877 


(L27428) reverse transcriptase 
Homo supiens] 


LI 



WO 01/02568 



PCT/US00/18374 





Nearest 


Neighbor 'BlastN vs. Gcnbank) 


Nearest Neighbor (BlastX vs. Non-Rerttmrlrmr Pmi^ino 


SEQ 
ID 


ACCESSION 


1 DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1346 


U48734 


Human non- muscle 
alpha-actinin mRNA, 
complete cds 


le-08 


168237 


(M76546) hydroxyproline-rich 
protein [Helianthus annuus] 


0.19 


1347 


M76724 


Human leukocyte 
adhesion receptor 
alpha subunit 


1*08 


1177607 


(X92485) pvai [Plasmodium 
vivax] 


0.19 


1348 


AF067959 


Callus gallus 
homeodomain protein 
HOXD-3 mRNA, 
complete cds 


lc-08 


' > 3165574 


(AF067942) No definition line 
found [Caenorhabditis elegans) 


0.15 


1349 


Z81014 


Human DNA 
sequence from 
cosmid U65A4. 
between markers 
DXS366 and DXS87 
on chromosome X * 


le-08 


2072964 


(U93569) putative pl50 [Homo 
sapiensl 


0.001 


1350 


X57103 


Human h-Iys gene for 
lysozyme (upstream 
region) 


7c-09 


<NONE> 


<NONE> 


<NONE> 


1351 


AFO 12899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 


7e-09 


231629 


BILE-SALT-ACTIVATED 
LIPASE PRECURSOR ESTER 
LIPASE) (STEROL 
ESTERASE) (CHOLESTEROL 
ESTERASE) salt-activated 
lipase [Homo sapiens] sapiensl 




1352 


L34741 


Aplysia calirornica 
prohormone 
convertase (PC2) 

mRNA cnmnlprp rrlc 

IUVL ' i\ t WUIIipiwIC lvl*}> 






cytochrome-c oxidase (EC 
1,9.3.1) chain II precursor - 
Synechocystis sp. (PCC 6803) 

ilea i nn i 

>gi|581739 sp ] 


5.0 


1353 


( 

AF052959 < 


-lomo sapiens type 
KV collagen 
COU5Al)gene, 
i.\On 6 


4e-09 


1 

1 
i 

( 

131269 c 


PHOTOSYSTEM II P680 
CHLOROPHYLL A 
APOPROTEIN (CP-47 
PROTEIN) 

>gi|7270S|pir|(QJLV6A 
>hotosystcm II chlorophyll a- 
>inding protein psbB - liverwort 
Marc nan ti a polymorph a) 
hloroplast >gi|l L700 


1.8 
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1 Nearest N^iohhnr fRIricrM »,«r r.*nK-,nL-\ 




SEQ 
ID 


1 

jACCESSKtt 


4 DESCRIPTION 


P VALUE 


i nearest iNeienoor miasm vs. £Non-Kedundant f 
| ACCESSION | DESCRIPTION 


Steins) } 

„ p value! 


1354 


1 L 15470 


Sircpiomyces 
clavuligerus (NRRL 
3585) clavulanic acid 
biosynthesis protein 
(cla) gene, complete 
cds and clavaminaie 
synthase 2 (cs2) gene, 
partial cds. 


4c-09 


| 586028 


UAQMATIWE • ■ 

jurvcvri, I UKULAobj (AUH) 

(PROCLAVAMINIC ACID 
AMIDINO HYDROLASE) 
^gi|136l423|pir||S57669 
fProclavaininic acid amidino 
Ihydrolase - Streptomyces 
clavuligerus >gi|295 171 
Proclavamimc acid amidino 
hydrolase [Streptomyces 
(clavuligerus] 

>gi|1586122| P rf]t2203286B 
Iproclavaminic acid amidino 
Ihydrolase [Streptomyces 

clavuligerus] 


Am 1 1 1 ' 


1355 


AB002302 


Human mRNA for 
KIAA0304 gene, 
complete cds 


2e-09 


I 131600 


GENERAL SECRETION 
PATHWAY PROTEIN L 
(product {Klebsiella pneumoniae 
>?t|149311 (M326l3)pulL 


2.5 


1356 


L34219 


Homo sapiens 
retinaldehyde-binding 
protein (CRALBP) 
gene, complete cds. 


le09 


<NONE> 


<NONE> 


<NONE>| 


1357 


AB002302 


Human mRNA for 
KIAA0304 gene, 
complete cds 


le-09 


2224549 


(AB0O23O2) KIAA0304 [Homo 
sapiens] 


5.0 I 


13581 


l 


Homo sapiens 
HSPA1L gene for 
tfeat shock protein 70 
estis variant, 5LTR, 
partial sequence 


le-09 1 


1 

1389766 


(U58658) unknown [Homo 
sapicnsl 


1.3 


I 1359 1 


I 

r 
r 
( 
e 
s 

AF064483 f 


iomo sapiens natural 
esi Stance-associated 
nacrophage protein 2 
NRAMP2) gene, 
.ton 17, alternatively 
pliccd non-IRE 
orm. complete cds 


8e-l0 1 


It 

113671 |f 


!!! ALU CLASS F WARNING 
•NTRY !!!! 


0,72 1 
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I Nearest 


Neighbor (BiastN vs. Genbank) 1 


Nearest Neighbor f BiastX vs. Nhn.R*H>.~^ d : 1 


SEC 

L m 


1 

I ACCESSION 




P VALUE 1 


_ ACCESSION 


j DESCRIPTION 


p value| 


1360 


I AF002283 


Mus musculus alpha- 
actinin-2 associated 
LIM protein mRNA, 
alternatively spliced 
product, complete cd< 


* 6e-10 1 


2996196 


(AF053367) carboxyl terminal 
UM domain protein [Mus 
Imusculus] 


4c-21 1 


1361 


1 M26220 


African green 
monkey origin of 
replication 


5e-10 J 


2143455 


gene DMR-N9 protein - mouse 
l(fraement) 


8.8 1 


1362 


Z78006 


H.sapiens flow-sorted 
chromosome 6 
Hindlll fragment, 
SC6pA7F10 


4c- 10 1 


2072977 


(U93574) putative pt50 [Homo 


0 005 ] 


1363 


U82303 


Homo sapiens 
unknown protein 
mRNA, partial cds 


2e-10 1 


1825711 


^JS8l83) similar to the 
immunoglobulin superfamily, 
most similar to nerual cell 
adhesion proteins 
(Caenorhabdfcis ele«ansl 


0.031 I 


L364 


AF079764 


Drosophila 
melanogaster 
enhancer of 
polycomb 


2e 10 


3757890 1 


(AF079764) enhancer of 
polycomb [Drosophila 
melanogaster! 


ic-io 


1365 


L24123 


Homo sapiens NRf I 
protein (NRF1) 
mRNA, 


2e*l0 1 


(AC004j2U> similar to NFE2- 
related transcription factors; 
similar to [48694 
(PID:g2 137676) [Homo 
3004573 Isaniensl 


4e-53 1 


1366 


< 

1 

M91454 


Drangutan alpha- 
*lobin fene duulicate 
egton. 


le 10 I 


i 
< 

1 

V 

( 

464239 If 


^ADH- UB IQUINONE 
3XIDOREDUCTASE CHAIN 
l>gi|I085l85|pir||S52968 
^ADH dehydrogenase chain 4 - 
loneybee mitochondrion 
SGC4) >gi|552446 (L06178) 
*ADH dehydrogenase subunit 4 
Apis mellifera ligustica] 


6.0 1 


13671 


I 
t 

D871L7 c 


louse mouse; 
tfuscutus domesticus 
>rain mRNA for 
IAP102, complete 
ds 


6e-U I 


( 

473912 c 


-31961) phosphoproteln [Mus 
Dokii] 


2.2 


| 1368| 


1 

( 
F 

AC0Q1002 s 


lomo sapiens 
subclone 2_h9 from 
n H39) DNA 
equence 


5e-U 1 


<N0NE> J 


<NONE> 


cNONE> 1 



WO 01/02568 



PCT7US00/18374 



SEQ 

& Iaccession 



Nearest Neighbor (BlastN vs. Genbank) 



i369jAC0Ol0O2 



1370 1 ABQQ7874 



1371 1 ACOOlOfl? 



Homo saptens 
(subclone 2_h9 from 
PI H39) DNA 
Sequence 



Nearest Neighbor rm^ v 5 , mn-R^^T^Zr^- 



DESCRIPTION I p VALUE I ACCESSION 



Homo sapiens 
KIAA0414 mRNA* 
partial cds 



5c- 11 



Homo sapiens 
(subclone 2_h9 from 
PI H39) DNA 
sequence 



Sc^ll 



Homo sapiens 
(subclone 2 Ji9 from 
PI H39) DNA 

_1372| ACQ0I0Q2 sequence 

I Homo sapiens 



5e-U 



5c- II 



JI73JACOOI002 



1374 1 AC001QQ2 



(subclone 2Ji9 from 
PI H39) DNA 
sequence 



Homo sapiens 
(subclone 2„h9 from 
PI H39) DNA 
sequence 



J 375j 221352 



H. sapiens mRNA for 
KERV-fC long 
terminal reoeat 



5e-ll 



5e-Ii 



5e-li 



_ 1376 1 AB0Q792S 



i,377l D37I17 



1378 1 AJ13I5QI 



13791 M27826 



Homo sapiens mRNAJ 
for KIAA0459 
protein, partial cds 



House mouse; 
Musculus domesiieus 
brain mRNA for 
SAP 102, complete 
cds 



5e-ll 



Homo Sapiens DNA 
sequence between 
two AML 1 gene 
promoters, 6423 BP 



Sell 



5e-ll 



Human endogenous 
retroviral protease 
mRNA. complete cds,| 



5c* II 



<NONE> 



<NONE> 



<NQNE> 



<NONE> 



<NONE> 



DESCRIPTION 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



419481 



2947238 



<NONE> 



gag polyprotein - human 
endogenous virus S71 



(AF05 1782) diaphanous i 
Homo sapicnsl 



473912 



728831 



[PVALUEl 



I <NONE> I 



|<NONE>| 



(L3I961) phosphoprotein [Mus 
cookii] 



,m ALU SUBFAMILY J 
WARNING ENTRY 



8855S 



retroviral proteinase- like protein | 
-human 



[ <NQNE>I 



[<NONE>l 



l<NONE> 



I <NONE> I 



4.6 



2.8 



1.8 



0.20 



0.002 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs, Non-Redundant Proteins) 


SEQ 
IV 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1380 


U23804 


Drosophiia 

melanogaster putative 
GTP-binding 
regulatory protein 
beta chain (GPB) 
mRNA. panial cds. 


5e-ll 


2494916 


TRP-A5P RtPhAIS 
CONTAINING PROTEIN 
T10F2,4 IN CHROMOSOME 
HI protein; similar to G-Beta 
repeat region (Trp-Asp 
domains) of guanine nucleotide 
binding protein 


le-30 


1381 


222784 


M.musculus troponin 
I Bene, 


3e-tl 


3892202 


(AF072889) transcription 
repressor brain factor 2 


0.053 


1382 


AB007880 


Homo sapiens 
KIAA0420 mRNA, 
complete cds 


2e-ll 


<NONE> 


<NONE> 


<NONE> 


1383 


AF020361 


9 Homo sapiens B AX 
gene, exon 6, partial 
sequence 


2e~ll 


<NONE> 


<NONE> 


<NONE> 


1384 


L35600 


Homo sapiens DNA 
sequence. 


2e-li 


1174952 


GLYCOPROTEIN D 
PRECURSOR gD [Bovine 
herpesvirus l] 


0.25 


1385 


U21943 


Human organic anion 

transporting 

polypeptide 


2e-lt 


2738223 


(U9501 1) brain-specific organic 
anion transporter 


9c-19 


1386 


U90878 


Homo sapiens 
carboxyl terminal 
UM domain protein 


2c-ll 


2996196 


(AF053367) carboxy! terminal 
LIM domain protein (Mus 
musculusj 


4e^23 


1387 


U31929 


Human orphan 
nuclear receptor 
(DAXl)genc, 
complete cds 


6e-l2 


<NONE> 


<NONE> 


<:NONE> 


1388 


M25828 


Human von 
Willebrand factor 
gene, exon L 2, and 
3, and three AJu 
repetitive elements. 


6e-12 


<NONE> 


<NONE> 


<NONE> 


1389 


AB020648 


Homo sapiens mRN A 
tor KIAA0S41 
protein, partial cds 


3e-l2 


<NONE> 


<NONE> 


<NONE> 


1390 


Z 15026 


H.sapiens genes for 
tumor necrosis factor 
(Tnfa) and 
Ivmphoto.Mne (Tnfb) 


2e-l2 


<NONE> 


<NONE> 


<NONE> 


1391 


L28101 


Homo sapiens 
kallistatin (PI4) gene, 
exons M. complete 
cds 


2e-12 


<NONE> 


<NONE> 


<NONE:> 


1392 


Z47046 


Human cosm id 
OLL2C9 from Xq2S 


2e-t2 


<N0NE> 


<NONE> 


<NONE> 



WO 01/02568 
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J or/ 

I SEC 


1| [Neares 
2 


jNeighbor fBlastN vs. Genbank) 


1 Nearest Neigl 


ibor(BlastX vs. Non-Redimdn n t Proteins) | 


ID 


|ACCESSI0I 


v| DESCRIPTION 
rl. sap tens rtoW"SOrte 


1 P VALUE 
7 11 - 


i 1 ACCESSION 


DESCRIPTION 


|p value! 


J 139; 


\\ Z79007 


chromosome 6 
Hindin fragment, 
SC6pA20E2 


1 

I 2e-12 


I 106322 


hypothetical protein (L1H J 
region) - human 


I L5 1 


1 1394 


A U34377 


Human tyrosine 
kinase TXK(txk) 
gene.exon 13, 


1 le 12 


j 151484 


|(M55524)ORF 4; putative 
IfPseudomonas aeruginosa] 


1 4.3 1 


1 1395 


J D70845 


Mus musculusapg-I 
gene For novel 
member of heat shod 
protein 1 10, promote] 
region 


c| 

r| 

le-12 


1 . 113658 


ALKALINE PROTEINASE 
PRECURSOR (ALP) precursor 
fungus (Acremonium 
Ichrysogenum) 


I 3.5 I 


1396 


! M63978 


Human vascular 
endothelial growth 
facror »ene, exon 8, 


1 le-12 


1 3982737 


(AF069731) calmodulin- 
Idependent protein kinase II beta 
[M isoform [Rattus norvegicusl 


0.083 j 


1397 


U60266 


Homo sapiens 
lysosomal alpha- 
mannosidase (manB) 
mRNA, complete cds 


1 8e^l3 


1 <N0NE> 


<NONE> 


I <NONE>| 


1398 


Z68297 


Caenorhabditis 
clegans cosmid 
F11A10, complete 
sequence 
[Caenorhabditis 
elegansl 


7e-13 ! 


2393734 f 


(AC002542) similar to C 
elegans FU A 10.5; 80% 
similarity to Z68297 
(PID:gl 130619) [Homo 
sapiens] 


5e-34 I 


1399 1 


Z68297 


Caenorhabdius 1 
elegans cosmid ( 
F11A10, complete I 
sequence 
Caenorhabditis 
slegansj | 


7ei3 1 


I 

1 

1 

( 

2393734 s 


(AC002542) similar to C. 
elegans F11A10.5; 80% 
similarity toZ68297 
'PID:g 1 130619) [Homo 
sapiens) 


3eOS J 




1 

< 
I 
f 

A 

Z6S885 E 


Human UNA 
equence rrorn i 
"OsmidL21Fl2B, * 
Huntington's Disease [ 
tegion, chromosome 1 
p 163. contains I 

:st. 


6e-l3 I 


<NONE> 


<NONE> | 




1401 


X76104 k 


I.sapiens DAP^ j 
inase mRNA 


6e -13 1 


2911154 rr 


\B007l43) ZIP-kinase [Mus 
lusculus] j 


<:NONE>| 
0.007 J 


1402 j_ 


h 

c 
ft 

Z78668 S 


l.sapiens flow-sorted 
iromosome 6 Taql 1 
'agment. j 
C6pAl3G4 j 


5*13 1 


In 

106322 re 


ypothetical protein (L1H 3* 
gion) - human f 


2e-06 


1 I4Q3 1 


H 

L35600 si 


omo sapiens DNA [ 
squence. | 


3e-I3 | 


j(/ 

3184290 Jpj 


KC004 136) hypothetical 

-otein [Arabidopsis thai i ana] | 


1.7 j 
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Nearest Neighbor (BlastN vs, Genbank) 


! Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
P 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 




Cloning vector 










1404 


AF09O452 


pKODT complete 
sequence 


2e-t3 


3876730 


(249966) F35C1L4 
[Caenorhabdius clegans] 


7,8 


1405 


D28126 


Human gene for ATP 
synthase alpha 
subunit, complete cds 
(exon I to 12) 


2e-13 


419481 


gag polyprotein - human 
endogenous virus S71 


3.4 


1406 


AF005219 


Homo sapiens 
transcription factor 
HOXDI3 


2e-13 


2822166 


(AC0O4O8O) transcription factor 
HOXA 13 [Homo sapiens] 


5e-09 


1407 


AB01S301 


Homo sapiens mRNA 
forKIAA0758 
protein, partial cds 


2e-13 


3882237 


(AB018301) KIAA0758 protein 
[Homo sapiens) 


le^3 


1408 


D70845 


Mus musculus apg- I 
gene for novel 
member of heat shock 
protein 1 10, promoter 


le-13 


113658 


ALKALINE PROTEINASE 
PRECURSOR (ALP) precursor - 
fungus (Acremonium 
chryso^enum) 


3.1 


1409 


AG00O691 


Homo sapiens 
genomic DNA,21q 
region, clone: 
T171BG33 


8e^l4 


930045 


(X15332) alpha*! (III) collagen 
[Homo sapiens] 


3e-04 ! 


1410 


D30785 


Mouse mRNA for 
neuropstn, complete 
cds 


8e-t4 


3559978 


(AJ005641) serine protease 
fRattus rattus] 


2e-12 


1411 


U32710 


Haemophilus 
influenzae Rd section 
25 of 163 of the 
complete genome 


Se-14 


4106673 


(AL035064) queuine trna- 
ribosyl transferase 
fSchizosaccharoroyces pombe] 


2e-38 


1412 


AG00O3S6 


Homo sapiens 
genomic DNA, 21q 
region, clone: j 
64E11XI9 ! 


7e-14 


1363925 


hypothetical protein 2 - North 
American opossum (fragment) 
>gi|897721 (Z48955) ORF-2, 
putative RT [Didelphis 
vireiniana) 


l.l 


1413 


Z62664 


Hsaplens CpG DNA, 
clone 7ldll, forward 
readcpg7ldlLftla. 


7c- 14 


3953461 


(AC002328) F20N2.6 
[Arabidopsis ihalianaj | 


0085 


1414 


AB014532 


Homo sapiens mRNA 
for KIAA0632 
protein, partial cds 


7e-14 


113668 


!!!! ALU CLASS C WARNING 
ENTRY! If! 


0.040 
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Nearest Neighbor rBIastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1415 


Z96478 


H sapiens telomeric 
DNA sequence, clone 
20PTEL004, read 
20PTELOO004.seq 


7e-l4 


2981631 


(ABO 12223) ORF2 [Canis 
familiaris] 


2e-04 


1416 


AFt00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


4e«14 


<N0NE> 


<NONE> 


<NONE> 


1417 


AF012899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


4e-14 


<NONE> 


<NONE> 


<NONE> 


1418 


AF033349 


Homo sapiens MLL 
gene breakpoint 
cluster region, Imron 
I, partial sequence 


3e-14 


728831 


ALU SUBFAMILY J 
WARNING ENTRY 


9,3 


1419 


AC0O1526 


Homo sapiens 
(subclone 4_f6 from 
PIH54) DNA 
sequence 


3e-14 


99861 


extensin - almond >gi|20420 
(X65718) extensin 


9,2 


1420 


AFO 12899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


3e-14 


728832 


!!!! ALU SUBFAMILY SB 
WARNING ENTRY 


0.15 


1421 


AF100694 


Mus musculus 
Pomin52 mRNA, 
complete cds 


2e-l4 


3913573 


EPHRIN-A2 PRECURSOR 
(EPH- RELATED RECEPTOR 
TYROSINE KINASE LIGAND 
6) (LERK-6) sapiens] 
>gi|292476l (AC004258) 
EPL6 JKUMAN [Homo sapiens] 


8,7 


1422 


AFO 1 2899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


9e-15 


119040 


ANTIGEN (E1B 19K) 
>gi|74142|pir||QlAD25 early 
E1B 21K protein II - human 
adenovirus 5 >gi|58489 
(X02996) mRNA 5 first reading 
frame (Human adenovirus type 
5] adenovirus type 5] 
>gi|209797 (JO 1969) 21 kD 
protein ! 


1.5 
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Nearest Neighbor (BlustN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












transcription factor GA^A-*, 




1423 


AF012899 


Sambuciis nieru 
ribosome inactivating 
procein precursor 
mRNA, complete cds 


Sc-15 


477102 


retinoic acid-inducible - mouse 

>gi|293345 (M98339) GATA- 
btnding transcription factor 
[Mus musculus] 


0.57 


1424 


AB0L2223 


Cauk fnmiliaris LINE 
1 clement ORF2 
mRNA, complete cds 


8e-l5 


92385 


hypothetical protein - rat 
(fragment) 


0.003 


1425 


AF100694 


Mu$ musculus 
Pontin52 mRNA, 
complete cds 


3e-t5 


<NONE> 


<NONE> 


<NONE> 


1426 


X12433 


Human pHS 1-2 
mRNA with ORF 
homologous to 
membrane receptor 
proteins 


3e-l5 


422532 


collagen alpha 3(IV) chain - sea 
urchin 


8.9 


1427 


AF012899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


3e-l5 


1353143 


PROBABLE NliCLtAR 
HORMONE RECEPTOR 
E02H1.7 

>gi|387543 l|gnl|PID|e 1344980 
(Z47075) similar to Zinc finger, 
C4 rype (two domains) 
[Cacnorhabditis ele*an$l 


5.0 


1428 


Z69651 


Human DNA 
sequence from 
cosrnid I 7SB9 
Huntington's Disease 
Region, chromosome 
4pl6.3 


3e-l5 


403460 


(L24521) transformation-related 
protein [Homo sapiens] 


0.60 


1429 


AF012899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


2e.l5 


108750 


Ig heavy chain precursor 
(B/MT.4A.17.H5.A5) - bovine 
>gi|440 (X62916) anti- 
lesiosierone antibody (Bos 
taurus] 


LI 


1430 


X83299 


H.sapiens SMA3 
mRNA 


2e45 


671530 


(X83299) SMA3 gene product 
Homo sapiens] 


0.32 


1431 


U01877 


Human p300 protein 
mRNA, complete cds. 
> gb|I62297|I62297 
Sequence I from 
patent US 565S784 


2e-l5 


3024341 


El A- ASSOCIATED PROTEIN 
P300 


0.019 
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SEQ 



Nearest Neighbor (BlastN vs. Genbank) 



Nearest Neighbor (BlastX vs. Non-Kedundant Prot^no 



ID ACCESSION DESCRIPTION I P VALUE 1 ACrr^rrw 



1432| X16516 



1433 J M74I65 



1434| X71893 



DESCR IPTION |PVATT rn| 

^ 



Mouse MHC (Qa) Q2j 
k gene for class I 

antigen, exons 4-8 I le-I5 [ 249 6897 
Chicken tensin 

nRNA. complete cds.| le-t5 1 283920 



^sapiens gene tor 
immunoglobulin 
kappa light chain 
variable region 04 
and 05 



9c-16 



<N0NE> 



CHROMOSOME III 
>gi|3874384(gnl|PID|ei344078 
EST EMBL:CQ8256 comes 
from this gene; cDNA EST 
EMBL.C09941 comes from this | 
gene; cDNA EST y|t340al0.3 
comes from this gene; cDNA 
ESTylc340al0.5 comes from 
thi Menc[Ca.. . 

tensin - chicken >gi|2 12752 
(M74165) tensin 



7e-08 



2e-I9 



J435| UQ5227 



j436| M234Q4 



Human Rar protein 
mRNA. complete cds. 1 9e- 1 6 



3036779 



Chicken erythrocyte 
anion transport 
protein (band3) 
ffiRNA. complete cds J 



9e-16 



_ <NONE> I <NONE> I 

CTTO) match: multiple 1 

proteins; match: 000407 
Q12S29P22I27 P36861 
Q402I9; match. P70550 
Q4I022 P22I25 Q08155 
P352S6; match; P5114S P5U47J 
P35293 P36861 P352S9; match: 
P35284 Q40217P5U32 
P51I57 P51 158; match: Q41022I 3e-06 



726403 



1437 1 X16I45 



I438| AFQ12899 



1439 1 AF07698I 



Rat mRNA for liver aj 
L-Fucosidase (EC 
3.2.1.51) 



9e-I6 



67502 



Sambucus nigra 
ribosome inactivating I 
protein precursor 
mRNA, complete cds J 8e-l6 



vTus musculus brain 
mitochondrial carrier , 
protein BMCPt 
(Bmcpl) mRNA t 
complete cds 



<N0NE> 



8e-16 



3851540 



(U23175) similar to anion 
exchange protein 
[Cacnorhabditis elegans? 



alpha-L-fucosidase (EC~ 
3.2.1.51) I precursor, tissue - 
human >gi| 178409 (M29877) 
alpha-L-fucosidasc precursor 
(EC 3.2, 1 .5) [Hpmo sapiens! 



Le-28 



2e-29 



<NONE> 



! <NONE> 



(AF078544) brain mitochondrial 
carrier protein- 1 [ H o mo sapiens ]| 



11 1 
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Nearest 


Neiehbor {BlasiN v$. Genbank) 


_ Nearest Neighbor (BlastX vs. Non^Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DcSCRIPTION 


RVALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Rsapiens MN/CA9 






!!!! ALU SUBFAMILY J 




1440 


254349 


GENE 


5e-l6 


728831 


WARNING ENTRY 


0.002 


1441 


AF077O03 


Mus musculo SH3 
domain-containing 
adapter protein 
mRNA. complete'cds 


3e-16 


309)23 


(M35526) complement 
component C5D [Mus 
musculus] 


3 J 


1442 


X64S87 


M.musculus mRNA 
for splicing factor 
U2AF(65kD) 


3e-l6 


2143767 


glycoprotein - rat >gi|986943 
(L08134) glycoprotein [Ratius 
norvegicus] norvezicus] 


0.003 




a on 1 -1 K£. i 

ABU143GI 


Homo sapiens rnRNA 
forKIAA066l 
protein, complete cds 


3e-l6 


3327136 


(AB01456DKIAA0661 protein 
[Homo sapiens] 


Ic-20 


1444 


Z739S7 


Human DNA 
sequence from 
cosmtdNl20B6 on 
chromosome 22 
Contains ESTs, 
complete sequence 
[Homo sapiens] 


le-16 


<NONE> 


<NONE> 


<NONE> 


1445 


M5831S 


Homo sapiens ala 
gene. 


le-16 


<NONE> 


<NONE> 


<NONE> 


1446 


U44103 


Human small GTP 
binding protein Rab9 
mRNA. complete cds 


le-16 


1552584 


Z80233) hypothetical protein 
Rv0O29 


1.3 


1447 


ABO 14561 


Homo sapiens mRNA 
forKlAA0661 
protein, complete cds 


9e-17 


3327136 


(ABO 14561) KIAA066I protein 
Homo sapiens] 


2e^20 






Mus musculus 
Pontin52 mRNA, 
complete cds 


le-17 


<NONE> 


<NONE> 


<NONE> 


1449 


M76762 


Mus musculus 
ribosomal protein (Ke 
3) gene, exons I to 5. 
and complete cds. 


le-17 


1073048 


pupR protein - Pseudomonas 
putida >£j|525260 


0.36 


1450 


D50561 


Human DNA, 
'eplication enhancing 
;lemem(REEl) 


4c- 1 8 


126295 


LfNE-1 REVERSE 
TRANSCRIPTASE 
HOMOLOG 


0.78 


1451 


\ 

D16431 ( 


Human mRNA for 
lepatoma-derived 
zrowth factor, 
rompletc Cds 


4e-18 


3242079 i 


AJ006984) proline-rich protein 


0.0 IS 
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1 Nearest Neighbor fBlastN vs. Gt»nh.™lrk 




SEQ 
ID 


1 

|ACCESSIOh 


1 DESCRIPTION 


| P VALUE 


ACCESSION 


POr IPiastA vs. iNOfl-KedUndnnf p 
DESCRIPTION 


toteins) 
P VALUE 


1452 


1 AF088983 


Mas musculus heat 
shock protein hsp40-. 
mRNA, complete cds 


3 

4c- 18 


3873707 


(Z73l02) Similarity to B.subtili 
DNAJ protein 

(SW:DNAJ BACSIJV oHNa 
EST yk437ai J comes from this 
gene [Caenorhabditis eJegans] 


s 

9e-25 


1453 


! U60205 


Human methyl sterol 
oxidase (ERG25) 
mRNA. complete cds 


3e-18 


<NONE> 


_ <NQNE> 


<N0NE> 


1454 


1 AF038177 


Homo sapiens clone 
23899 mRNA 
sequence 


Le-18 


1360775 


G protein -coupled receptor 74 * 
equine herpesvirus 2 >gi|695246 
(U20824) G protein-coupled 
receptor [Equine herpesvirus 2] 


5.1 


1455 


ABO 14561 


Homo sapiens mRNA 
for KIAA0661 
protein, complete cds 


le IS 


3327136 


(AB014561) KIAA0661 protein 


tc-.:l 


1456 


AB014561 


Homo sapiens mRNA 
for KIAA0661 
protein, complete cds 


ic-18 


3327136 


(ABO 1 45611 KIAA0661 nrntein 
f Homo sapiens] 


le-^2 


1457 J 


U34374 


Human tyrosine 
kinase TXK (txk) 
eene. exons 9 and 10. 


le-I9 


<NONE> 


<NONE> 


<NONE> 


14581 


AB006969 


Homo sapiens 
hGAAl mRNA, 
complete cds 


le-19 


4151809 


[AF102855) synaptic SAPAP- 
nteracting protein Synamon 


0,19 


1459 


AB002293 


Human mK.N A tor 
KIAA0295 gene, 
partial cds 


le-19 


( 

2224531 . , 


AB0O2293) KJAA0295 [Homo 
sapiens] 


6e-l7 


1460 1 


] 
c 

Z59664 t 


4. sapiens CpG DNA, 
:lone 168 r9. reverse 
ead cpgl6$f9.rtla , 


5e/>0 


( 

3880251 < 


Z82055) predicted using 
jenefinder 


6.5 


1461 | 


I 
r 

( 

M73837 c 


Human modulator 
ecognition factor 2 
MRF-2) mRNA. 
omplete cds. 


5e-20 


n 

2843 13 h 


nodulator recognition factor 2 - 
uman factor 2 [Homo sapiens! 


0.019 



WO 01/02568 



PCT/US00/18374 





1 Nearest Neighbor ( BiastN v S . Genbank) 


Nearest Neiahbor rniicrv n'T. — : — — — : — : 1 


SEC 
ID 


J ACCESSKft 


4 DESCRIPTION 


P VALUE 


ACCESSION 


— ^ TJ . ^uu-ncuiincianr r 

__ DESCRIPTION 


roteins) ^ 

pvalueI 


1462 


U24267 


Human pyrroIine-5- 

carboxylate 

dehydrogenase 


5e-20 


2506350 


CARBOXYLATE 
DEHYDROGENASE 
PRECURSOR (P5C 
DEHYDROGENASE) 
>gi|I353248 sapiens] 
>sill3532Sft (Tl?d'7A7\ nvmliu 
5-carboxylate dehydrogenase 
[Homo sapiens] 
>gi(I589585|prf||221 1355 A 
l^ciia i-pynroiinco-caxDOxylatc 
dehydrogenase [Homo sapiens] 


5e-04 | 


1463 


1 U13262 


Mus muscuius myelin 
gene expression 
factor 


4e-20 


536926 


(U 13262) myelin gene 
expression factor [Mus 
muscuius] 


3e -07 1 


Idful 


U13262 


Mus muscuius myelin 
gene expression 
factor 


4e-20 


3126878 


(AF061332)M4 protein 
deletion mutant [Homo sapiens] 


lc-08 1 


\AAK 1 
l*+02 | 


261239 


Ksapiens CpG DNA. 
clone 48fl0, forward 
read cpg48ft0.ftlu . 


4e-20 


1669601 


(D88747) AR401 [Arabidopsis 
thuliana] 


8c^l9 J 


L466 


US99I5 


Mus muscuius 
junctional adhesion 
molecule (Jam) 
mRNA, complete cds 


te-20 


3462455 


[U899 15) junctional adhesion 
molecule [Mus muscuius] 


7e-ll 1 


14671 


AF02907! 


Gall us gall us p52 pro- 
ipotouc protein 
TtRNA. complete cds 


7e-22 


1 

2599492 f 


AF029071) p52 pro-apototic 
>rotein (Callus eallus] 


le- 15 1 


14681 


s 
f 

M25636 i 


c igure 4. Nucleotide 
sequence of the 
>KS36 1 .797 kb 
nsert. 


6e-22 


( 

1196398 


M21305) unknown protein 
Homo sapiens) 


065 


14691 


f 

f 

AB020655 p 


■lomo sapiens mRNA 
or KIAA0S4S 
rotein. complete cds 


6e-22 


( 

4240325 f 


ABO20725) KIAA0918 protein 
Homo sapiens] 


le-19 | 



WO 01/02568 
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SEQ 
ID 



147 



1471 



Nearest Neighbor (BlastN vs. Genb ank) j Nearest Neighbor (BlastX vs. Non-Redundant Proteins! " ^ 



ACCESSION 



DESCRIPTION 



S80935 



AF053066 



chorionic 

gonadotropin beta 1 
(CG beta I) subunit 



Homo sapiens 
microsaiellite 
D5S2926 sequence 



P VALUE I ACCESSION 



DESCRIPTION 

prululLaol.s ALPHA" 



5e-22 



115310 



HIV; CHAIN PRECURSOR 
>gi|849 l7|pir||A3 1893 collagen 
alpha 1(IV) chain precursor - 
fruit fly (Drosophila 
ImcJanogaster) melanogasterj 
>gi| 157078 (M96575) type IV 
Icollagen pro-collagen 
j[Drosophila melanogaster] 



2c-22 



! ! ! ALU SUBFAMILY J 
72883 1 [WARNING ENTRY 



P VALUE 



0.027 



3e-04 



1472 



U55177 



Danio rerio carbonic 
anhydrase homolog 
CAH-ZmRNA. 
complete cds 



2e-22 



1473 



AF064250 



Gallusgallus 
ubiquitin specific 
protease 66 



3123190 



CARBONIC ANHYDRASE 
I (CARBONATE 

DEHYDRATASE) >gi|2576335 
|(U55177) CAH-Z [Danio rerio 



5e-14 



2e-22 



2736064 



(ARH6107) ubiquitin specific 
rotease41 fGallus 



1474 



AF030880 



Homo sapiens 
pendrin (PDS) 
mRNA. complete cds 



1475 



AF10O694 



Mus muse ul us 
Pontin52 mRNA, 
complete cds 



1476 



X57398 



Human mRNA for 
M5 protein 



1477 



ABO 1 0998 



attus norvegicus 
PAD-RH mRNAfor 
Peptidylarginine 
dciminasc type I, 
bmplete cds 



2e-22 



729367 



IKAPKUlfclN k D0wr?7 
JREGULATED IN ADENOMA) 
>gi|2l35020|pir||A47456 down 
regulated in adenoma (DR A) - 
human >gi|29l964 (L027S5) 
Nuclear localization signal at 
|AA 569-573, 576-580, 579-583 
acidic transcr. activ. domain 620 
640,: homeobox motif 653-676 
'[Homo sapiens] 



7e-37 



4e-53 



6e-23 



<NONE> 



<NON"E> 



<NONE> 



|Pm5 protein - human 
3e~23 | J07350 >gi[i3352731gn!|PID[e3624t 



le-04 



2c-23 



<NONE> 



<NONE> 



<NONE> 



1478 



D 10871 



uman h NAT allele 

2 gene for 
arylamine N- 
acetvlcransferase 



(J04734) CDC6 protein 
lc ~ ~ y » 17 1200 [Saccharomyccs ccrcvisiae | 



9.8 



1.479 



DI0871 



Human h NAT allele 

2 gene for 
arylamine N- 
acetyltransferase 



2e-23 



171200 



(J04734) CDC6 protein 
fSaccharomyces cerevisiaej 



S.3 



V) 5 



WO 01/02568 



PCT/US00/18374 



ACCESSION! DESCRIPTION } p VALUE 
[Homo sapiens MLL 
AF4 fusion protein 



J48QI AF024541 [mRN A. partial cds | 2e-23 



[Human AF-4 mRNA. 
* 48 1 ' L *3773 complete eds. | 2e -23 

]Mus muscuius 
„ Pontin52 mRNA, 

H82 .API 006 94 complete cds | 8e-24 



(Drosophila 

Imelanogaster Rga and 

|Aru genes, complete 
1 1483 U75467 U« 



8e-24 



7e-24 



Mus muscuius 
Pontin52 mRNA, 
14851 AF10Q694 complete cds 



7e-24 



ACCESSION 



,2136142 



3063962 



<N0NE> 



Human 28S 

1486 J MU 167 Liposomal RNA gene J 2e~2 4 
|Mus muscuius 
Pontin52 mRNA, 

1487 AF100 694 complete cds f 2e 24 



(Cloning vector 
pAP3neo DNA, 
I 1488 1 AB0034fift complete seqaence 
(Human mRNA of 



trk 



loncogene > :; 
gb|[96186|i96l86 
[Sequence 23 from 
1489 1 X035 41 patent US 5734039 



jHomo sapiens 
(subclone 2_g II from} 
P1H43)DNA 
1490 1 Jjl652 sequence 
fbrosophila 
Imelanogaster 
[strawberry notch 
(sno) mRNA, 
1 14911 U95760 complete cds 



2c-24 



2e-24 



I Mus muscuius 
Pontin52 mRNA, 
AF10Q694 [comp lete cds 



2e-24 



<NONE> 



1169643 



3875481 



987050 



225047 



,2078282 



2623773 



DESCRIPTION 



serine/proline-rich FEL protein, 
splice form 1 - human 



(AF03 1404) MLL-AF4 fusion 
protein [Homo sapiens] 



(U75467) Atu [Drosophila 
mejanojggtcr] 



2e*37 



NEUROPEPTIDES 
PRECURSOR >gi|41620S 
(U03137) neuropeptide 
precursor FMRFamide-related 
peptide [Lymnaea stagnalisl 



(281054) predicted using 
Geneflndcr; Similarity to UDP- 
glucoronosvltransferases | 5.1 



USPl PROTEIN PRECURSOR 
>gi\ 169623 J 12 



(X65335) lacZ gene product 
^unidentified cloning vector] 



(M74509) [Human endogenous 
retrovirus type C oncovirus 
sequence.], gene product [Homol 
sa P'"*ns) ]_ ^04 



reverse transcriptase related 
protein [Homo sapiens] 



(U95760) Sno [Drosophila 
rneianogasrer] 



(AF004S35) tyrocidine 
synthetase 3 [Brevibacillus 
brevis] 



WO 01/02568 
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SEQ 

>D I accesston 



DESCRIPTION 



Homo sapiens mRNA 
for LAK-4p, 
J493 1 AB002405 complete cds 

Human mRNA from 



chromosome 15 gene 
with homology to 
jMHC-HLA-SB-l 
J494| K030Q2 limron A 

(Human tubulin- 
Ifoiding cofactor E 
1495 U6123 2 1 mRNA. comp lete cd< 



lAmbidopsis thaliana 
Col-0 putative RNA 
helicase A mRNA, 
[1496 U1Q245 complete cds. 



IRsapiens DN A for 
(endogenous retroviral 
'1497 XS92U hike element 



P VALUE I ACCESSION 



8e-25 



2496822 



DESCRIPTIO N 

ICATTJ5 
PROTEIN B0416JIN 
CHROMOSOME X >gi|746502 1 
(U23516) B0416.1 gene product 
fCaenorhabditts elcgansl J 9e-U 



8c-25 



7c-25 



1514614 



1465772 



(X92S42) nuclear protein [Mus 
(musculus] | le . l3 



(U61232) cofactor E [Homo 
[sapiens | 



2e-05 



5e-25 



1353239 



(U10245) putative RNA 
jhelicase A [Arabidopsis 
thaliana] 



le-37 



[Homo sapiens 
((subclone 2_g 11 from 
Pi H43) DNA 
,1498 1 L81652 sequence 



H.sapiens mRNA for 
_!499l X82S95 IDLG2 



I Mouse homeo box 
12.6 (Hox-2.6) rnRNA 
[1500 M36654 comp lete cds 



IMus musculus (clone 
pMLZ-l) zinc finger 
1501 L36315 protein 



Homo sapiens mRNA 
for KIAA0738 
[1502 ABQI 828I protein, complete cds 



jHomo sapiens 
. putative transcription 

'.15031 AFQI7433 kctor CR53 



3e-25 



(Yi2713)Pro-PoKdUTPase 
2065210 po'vprotein 



5e^06 



3e-25 



2e-25 



(U93568) putative p!50 [Homo 
2072961 sapi ens! I 5 e-l6 

MAGUK P55 SUBFAMILY" 
(MEMBER 2 (MPP2 PROTEIN)! 
(DISCS, LARGE HOMOLOG 
2497511 b , 



9e-26 



3323169 



(AE001255) T. pallidum 
[predicted coding region TPQ854 [ 1.9 



9e26 



[(Z67747) zinc finger protein 
1806134 [Mus musculus] 



4e-05 



!!!! ALU SUBFAMILY J 
9e-26 I 728831 (WARNING ENTRY 



le-07 



9c 26 



3219985 



[ZINC FINGER PROTEIN ZFP- 
29 I 



WO 01/02568 
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SBC 
ID 


Nearest Neighbor f RlastN vs. Genbank) 

I 1 ~~ ] 
ACCESSION! DESCRIPTION PVALUt 

1 I Hnmn cnnimc I 


1 i^earest iNeigr 
: | ACCESSION 


ibor (BlastX vs. Non-Redundant Proteins) 1 

DESCRIPTION LvAttJ 




LL AC001225 


((subclone c6 from 
BAC H94) DNA 
1 sequence 


8e 26 


j 2653713 


(U91823) small S protein 
[Hepatitis B virus] 


I 4.3 J 


1 AF100694 


IMus musculus 
Pontin52 mRNA, 
Jcomplete cds 


8e-26 


1 283446 


cyteine-rich surface antigen 72, 
CRP72 - Giardia lamblia 
(fragment) 


1 34 I 


1506 


1 X94912 


IH ^anif»n« Pr7"7 daha 




1 728837 


!!!! ALU SUBFAMILY SQ 
WARNING ENTRY 


4*09 J 


1507 


J AF 100694 


IMus musculus 
Pontin52 mRNA, 
{complete cds 


2e 26 


I <NONE> 


<NONE> 


<NONE>| 


1508 


1 U44103 


Human small GTP 
binding protein Rab9 
mRNA. complete cds 


! lc-26 


f 3327038 


(AB014512) KIAA0612 protein 
(Homo sapiens) 


I 8.7 1 


1509 


AFL 00694 


Mus musculus 
Pomin52 mRNA, 
complete cds 


9e-27 


4056454 


(AUA*>99U) Contains repeated" 
region wtih similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana, 
ESTs gb|234l65 and gb|Z 18788 
come from this gene, 
[Arabidopsis ihaliana] 


0.14 | 


15 10 1 


AG001212 


Homo sapiens j 
genomic DNA. 2 Iq 
region, clone: j 
9H1IN46 I 


9c-27 


126296 


LINE- 1 REVERSE , 
TRANSCRIPTASE 
HOMOLOG protein 
[Nycticebus coucang] 


0.012 I 


1511 1 


AF027131 


Mus musculus mucin 
glycoprotein MUC3 J 
mRNA, partial cds | 


9*27 _| 


2589172 


!U76551) mucin Muc3 [Rattus I 
lorvesicusl [ 


2e 14 1 


15121 


Rartus norvegicus 
CTD-binding SR-likc 
(protein rA9 mRNA, 
U49057 jcomplete cds J 


5e -*>7 1 


( 

1438534 r 


U490J7) r A9 [Rattus 
lorvesicusl 


le-04 


1513 J 


Human, plasminogen 1 
activator inhibitor- 1 1 
J03764 gene, exons 2 to 9, | 


3e-27 j 


<NONE> 


<NONE> h 


cNONE>| 


15141 


[c 

Z7S160 ( 


A. musculus partial 
ochlearmRNA 
done 28D2) 


3e-27 j 


( 

1490362 n 


Z7S160) unknown [Mus 
lusoulus) 


2c-05 


_15\5\ 


If 

jc 

Z64210 |r< 


[.sapiens CpG DNA. 
lone 99b4. reverse I 
^dcpa99b4 ; nla . | 


3e 27 1 


( 
S 
P 

2^57538 [< 


AB00453S) LIPOIC ACID 
YNTHETASE 
RECURSOR(LIP-SYN) 
iL'hizosaccharomvces pombe] 


le-06 j 



■%7<2 



WO 01/02568 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neiahbor (BlastX vs, Non-Redundant Pmteinct 


SEQ 
ID 


ACCESSION 


f DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Homo sapiens 










1516 


L35659 


(subclone H8 6J16 
from PI 35H5C8) 
DNA sequence. 


le-27 


<NONE> 


<NONE> 


<NONE> 


1517 


AF 100694 


Mus musculus 
Poniin52 mRNA, 
complete cds 


le-27 


1644471 


(U72686) odorant receptor 4 
[Danio rerio) 


15 


1518 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-27 


2738388 


(AF003534) hypothetical 
protein 004L [Chilo iridescent 
virus] 


6.7 


1519 


AB009271 


Homo sapiens gene 
for BCNT. panial cds 


Le-27 


3880909 


(AL032636) Y40BIB.3 
[Caenorhabditis elegans] 


4.6 


1520 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-27 


2133579 


spermatophorin Sp23 - yellow 
mealworm molitor] 


0.85 


1521 


AF 100694 


Mus musculus 
Pontln52 mRNA, 
complete cds 


le-27 


121805 


ENDOGLUCANASE A 
PRECURSOR 


0.58 


1522 


AF100694 


Mus musculus 
Poniin52 mRNA. 
complete cds 


le-27 


i 3722000 


(AF035323) survival motor 
neuron protein [Bos taurus) 


aio 


1523 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-27 


3328188 


(AF074902) laminin alpha chain 
Caenorhabditis elegans] 


0.083 


1524 


AF074382 


Homo sapiens IkB 
kinase gamma subunii 


le-27 


3641280 


(AF074382) IkB kinase gamma 
subunit [Homo sapiens] 


0.041 


1525 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-27 


4056454 


(ACU059ytJ) contains repeated 
region with similarity to 
gb|U43627 exiensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|ZiS788 
come from this gene. 
[Arabidopsis thaliana] 


6e-04 




L78778 : 


Homo sapiens 
subclone 2_e 10 from 
P1H49) DNA 
sequence 


le*27 


225047 


reverse transcriptase related 
protein [Homo sapiens) 


2e-09 


1527 


1 
I 

L03427 i 


^uman zinc finger 
Drotein basonuclin 
TiRNA. complete cds. 


le-27 


1488275 


U59<594) zinc finger protein 
jasonuclin [Homo sapiens] 


9e-22 1 


152S 


1 

I 
r 

U09954 c 


-Juman ribosomal 
)rotein L9 gene. 5* 
egion and complete 
'ds. 


4e 2S 


( 
< 

1 

2257538 [ 


AB00453S) LIPOIC ACID 
SYNTHETASE 
5 RECURSOR(LIP-SYN) 
Schizosaccharomyces pombe] 


2e-04 
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accession! description p value 



Ksapiens CpG DNA, 
clone 99b4. reverse 
264210 read cpg99b<rtla . I 4e-28 



Danio rerio carbonic 
anhydrasc homolog 
[CAH-ZmRNA, 
[complete cds 




ACCESSION 



DESCRIPTION 



3878570 



synthase; cDNA EST yk283bo\ 
comes from this gene; cDNA 
EST yk283b6,5 comes from this 
gene; cDNA EST yk472f5.3 
comes from this gene; cDNA 
EST yk472f5.5 comes from this 
gene; cDNA EST yk476e7.3., 
IcARBONIC ANHYDRASE 



7e-U 



(Human mRNA for 
very-long-chain acyl- 
CoA dehydrogenase 
(VLCAD), complete 
cds 



ACYL-COA 
DEHYDROGENASE, VERY- 
LONG-CHAIN SPECIFIC 
[PRECURSOR (VLCAD) 
>gi|950358 taurusl 



3e-27 



Homo sapiens 
[survival motor neuronl 
fpseudogene. complete! 
AFQ 16591 [sequence | 3e-2S 



Mus musculus 
Pontin52 mRNA, 
AF1QQ694 complete cds 



728831 



Hi! ALU SUBFAMILY J 
(WARNING ENTRY 



2e-28 



!!!! ALU SUBFAMILY SB 
728832 (WARNING ENTRY 



Mus musculus 
Pontin52 mRNA, 
AF I QQ694 [complete cds 



2e-28 



SEHYDRIN DHN3 
j>gi|t00035|pir||SI8l39dchydrin 
DHN3 - garden pea >gi|20709 
(X63063) pea dehydrin DHN3 
IfPisum sativum] 



[Mus musculus 
|Pontin52 mRNA, 
,AjF 100694 complete cds 



2eOS 



Mus musculus 
Pontin52 mRNA, 
AF 100694 [complete cds 



2c-28 



4056454 



-RELATED 

[NEUROPEPTIDES 
PRECURSOR >gi|4I6208 
(U03I37) neuropeptide 
(precursor FMRFamide- related 
[peptide rLymnaca stagnalisl 
(ACUOD990,) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34l65 and gbjZl8788 
[come from this gene. 
f Arabidopsis thaliana] 



0.004 



6e-04 



9e-05 



WO 01/02568 



PCT/US00/18374 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins* 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












Ucuusyyu) Contains repeated 


1537 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


2e-28 


4056454 


region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana, 
ESTs gb|Z34l65 and gbiZ 18788 
come from this gene. 
[Arabidopsis thaliana] 


2e-06 


1538 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


2c-28 


4056454 


<ACUU^990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34l65 and gbjZl8788 
come from this gene. 
(Arabidopsis thaliana) 


2e-09 


1539 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


2e^28 


4056454 


(ACUUSyyU) contains repeated 
region with similarity to 
gb|U43627 extensin (adExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gbjZl8788 
come from this gene. 
[Arabidopsis thaliana] 


le-09 


1540 


AF100694 


Mus musculus 
Pontin52 mRNA. 
complete cds 


2e-28 


4056454 


(ACUO^yyU) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34l65 and gbjZl87S8 
come from this gene. 
Arabidopsis thaliana] 


5e-10 


1541 


AF 100694 


Mus musculus 
Pontin52 mRNA* 
complete cds 


2e-28 


4056454 


(ACUUDyyu,) contains re pea tea 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|234l65 and gbjZ1878S 
come from this gene. 
Arabidopsis thalianal 


le-il 


1542 


AF 100694 


Mus musculus 
r0ntin3i n^uvJNA, 
complete cds 


2e-28 


3157926 


(AC002 131) Strong similarity to 
extensin-like protein gb|Z34465 
from Zda mays. [Arabidopsis 
thaliana] 


8e-12 


1543 


AF 100694 


Mus musculus 
Pontin52 mRNA. 
complete cds 


le-28 


<N0NE> 


<NONE> 


<NONE> 


1544 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONE> 


1545 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONE> 



WO01/02568 



PCT/USOO/18374 





Nearest 


Neighbor (BlastN vs. C 


ienbank) 


Nearest Neighbor (BlastX vs. Non-Rcdundani Proieins) 


SEQ 
ED 


ACCESSION 


DESCRIPTION 


P VAT IFF 


ACCESSION 


DESCRIPTION 


P VALUE 






Mus musculus 










1 JHO 


AMUU694 


Pomin52 mRNA, 
complete cds 


le-28 


<N0NE> 


<NONE> 


<N0NE> 


I J** I 


Ar 1UUC94 


Mus musculus 
Ponrin52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONE> 


1 J-*o 


Ar IUU094 


Mus musculus 
Pontin52 mRNA, 
complete cds 


I le-28 


<NONE> 


<NONE> 


<NONE> 


I <aQ 


Ar 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


• <N0NE> 


<NONE> 


<NONE> 


i 

IjJU 


AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


<N0NE> 


<N0NE> 


<NONE> 


1 CC 1 


Art 00694 


Mus musculo 
Ponun52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONE> 


IDD2 


AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


<;NONE> 


1 CC1 


API Art/" f\ j 

Ar 100694 


Mus musculus 
Poniin52 mRNA, 
complete cds 


le-28 


<NONE> 


<NQNE> 


<NONE> 


1 CCl 
1 J>4 


AM 00694 


Mus musculus 
Pontin52 mRNA. 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONE> 




AF 1 00694 


Mus musculus 
Ponun52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONE> 


1 QtC 


AF I 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


<N0NE> 


<NONE> 


<NONE> 


1 ^•*"7 
I J J / 


Ah 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONE> 


1558 


AF 100694 


Mus musculus 
Pontin52 mRNA, 


le-28 


<NONE> 


<NONE> 


<NONE> 


1559 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 




<NONE> 


<NONE> 


1560 


AF 100694 


Mus musculus 
Pomin52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONt> 


1561 


AF 100694 


Vlus musculus 
Pontin52 mRNA, 
:ompleie cds 


le-28 


<NONE;> 


<NONE> 


<NONE> 



WO 01/02568 



PCT/US00/18374 





| Nearest 


Neighbor (BlastN vs. Genbank) 


| Nearest Neighbor (BlastX vs. Non-Redundanf PmtPin*\ 


SEQ 
m 


aclessiocs 


f DESCRIPTION 
Mus muscuius 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


1562 


API 00694 


Pontin52 mRNA. 
complete cds 


lc-28 


<NONE> 


I <NONE> 


<NONE> 


1563 


AF100694 


Mus muscuius 
Pontin52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONH> 


1564 


AFL00694 


Mus muscuius 
Pontin52 mRNA, 
complete cds 


le-28 


' <NONE> 


<NONE> 


<NONE> 


1565 


AF100694 


Mus muscuius 
Pontin52 mRNA, 
complete cds 


le-28 


, <NONH> 


<NONE> 


<NONE> 


1566 


M87708 


Human simple repeat 
polymorphism. 


le-28 


<NONE> 


<NONE> 


<NONE> 


1567 


AF100694 


Mus muscuius 
Pontirt52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONE> 


1563 


AF100694 


Mus muscuius 
Poatin52 mRNA, 
:omplete cds 


le-28 


3924779 i 


PTCPctctotj mtttm iu ramnrni 
B; cDNA EST yk450d8.5 comes 
from this gene; cDNA EST 
yk249a6,5 comes from this 
gene; cDNA EST yk2l9a2.5 
comes from this gene; cDNA 
EST yk355e4.5 comes from this 
gene;cDNA EST yk224f4.5 
comes fr... 

>gi|39248S 1 |gnl|PlD|e 1354569 
from this gene; cDNA EST 
yk249a6,5 comes from this 
gene; cDNA EST yk219a2.5 
somes from this gene; cDNA 
EST yk355e4.5 comes from this 
gene; cDNA EST yk224f4.5 
:omes from... 


3.0 


L569 


J 
] 

AF100694 < 


vlus muscuius 
*ontin52 mRNA, 
-ompleie cds 


le-28 


' ! 
I 
1 
( 
F 

It 69643 


J tMRFAiViIDE-R£LAt£C) ' 
NEUROPEPTIDES 
PRECURSOR >gi|41620S 
U03137) neuropeptide 
)recursor f MRFamide-related 
>epiidc [Lvmnaea stagnalis] 


0.66 



13b 



WO 01/02568 



PCT/US00/18374 



SEQ 

to 


Nearest 
ACCESSION 


Neighbor ( 8 lastN v$. < 
f DESCRIPTION 


jenbank) 
P VALUE 


! Nearest Neichl 
! ACCESSION 


jorfBlastX v$, Non-Redundant Pi 
DESCRIPTION 


noteins) 
P VALUE 


1570 


AF 100694 


Mus musculus 
Ponrin52 mRNA, 
complete cds 


le-23 


■ 

3924779 


■ ftrcf>fT/fC3T yfc IJ0Ufl,j luiur 
from this gene; cDNA EST 
yk249a6.5 comes from this 
gene; cDNA EST yk2 1902J 
comes from this gene; cDNA 
EST yk355e4^ comes from this 
gene; cDNA EST yk224f4.5 
comes fr.„ 

>gi|392488 1 |gni|PlD|c 1 354569 
from this gene; cDNA EST 
yk249a6.5 comes from this 
gene; cDNA EST yk2I9a2.5 
comes from this gene; cDNA 
EST yk355e4 5 comes from this 
gene; cDNA EST yk224f4>5 
comes from... 


0.65 


1571 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


2133579 


spermatophorin Sp23 * yellow 
mealworm molitorl 


0.49 


1572 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


2133579 


spermatophorin Sp23 - yellow 
mealworm molitor] 


0.49 


1573 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le^28 


283446 


cyteine-rich surface antigen 72, 
CRP72 - Giardia lamblia 
(fragment) 


0.45 


1574 


AF100694 


Mus musculus 
Poncin52 mRNA, 
complete cds 


le-23 


2498937 


SPERMATOPHORIN SP23 
PRECURSOR mealworm 
>gi|l61725 (M92928) structural 
protein 


0.33 


1575 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


1492050 


(U603I5) MC107L [Molluscum 
contasiosum virus subtype I] 


0.1S 


1576 


AF 1 00694 


Mus musculus 
Pomin52 mRNA, 
complete cds 


le-28 


2133579 


jpermatophorin Sp23 - yellow 
mealworm molitorl 


o.oss 


1577 


] 
1 

AF 100694 < 


Mus musculus 
Pontin52 mRNA. 
:omplete cds 


le-28 


] 

( 

118588 


DEHYDRIN DHN3 
>gi|I00035ipir||S18139 dehydrin 
3HN3 - garden pea >gi|20709 
X63063) pea dehydrin DHN3 
Pisum sativuml 


0.0 IS 


157S 


i 
f 

AF 1 00694 t 


VIus musculus 
3 ontin52 mRNA, 
■omplctc cds 


le-23 


[ 

( 

1185SS f 


JifiHYDftW DHN3 
>gi|l0O035|pir||S 18139 dehydrin 
DHN3 - garden pea >gi|20709 
X63063) pea dehydrin DHN3 
Pisum sativum] 


0.016 




WO 01/02568 



PCT/US00/18374 



SEQ 
ID 


| Nearest 
ACCESSIONS 


Neighbor fBlastN vs. C 
1 DESCRIPTION 


jenbank) 
P VALUE 


Nearest Neiph 
ACCESSION 


bor (BlastX vs. Non-Redundant P 
DESCRIPTION 


roteins) 
P VALUE 


1579 


Af 100694 


Mus muscuius 
Pontiri52 mRNA, 
complete cds 


le-28 


118588 


dehYdriNdhF^ 

>gi|100035|p,rj|Sl8139 dehydrir 
DHN3 - garden pea >gi|20709 
(X63063) pea dehydrin DHN3 


i 

0.012 


1580 


AFL00694 


Mus muscuius 
Pontin52 mRNA, 
complete cds 


le-28 


. 4056454 


{ALtiui^Hj) contains repeated 
region with similarity to 
gb|U43627 cxtensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|234165 and gb|Z18788 
come from this gene. 
[Arabidopsis thalianal 


aoio 


1581 


AF10O694 


Mus muscuius 
Pontin52 mRNA, 
complete cds 


le~28 


118538 


Mhydrin OHM 

>gi|l00035|pir||S18139 dehydrin 
DHN3 - garden pea >gi|20709 
(X63063) pea dehydrin DHN3 
(Pisum sativum] 


0.002 


1582 


AF 100694 


Mus muscuius 
Ponun52 mRNA, 
complete cds 


le^28 


U69643 


FMWaMIDE-RELaTED 
NEUROPEPTIDES 
PRECURSOR >gi|4 16208 
(U03137) neuropeptide 
precursor FMRFami de-related 
peptide [Lvmnaea staanalis] 


0.002 


1583 


AF 100694 


Mus muscuius 
Pomin52 mRNA, 
complete cds 


lc-28 


4056454 


(ACUUDyyO) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34 165 and gbjZ 18788 
come from this gene, 
Arabidopsis thaliana) j 


0.002 


1584 


AF100694 


Mus muscuius 
Pomin52 mRNA. 
complete cds 


U-28 


118588 


DEkVDklM DUtf3 
>gi|l00035|pir||S 18139 dehydrin 
DHN3 - garden pea >gij20709 
[X63063) pea dehydrin DHN3 
Tisum sativum] 


0.002 


1585 


1 
1 

AF 100694 t 


Vtus muscuius 
Pomln52 mRNA, 
complete cds 


ie-28 


i 

I 
I 
I 

c 

4056454 


,AUX):5990) Contains repeated 
•egion with similarity to 
jojU'ijo-/ cxtensin {.utcxil; 
ienc from Arabidopsis thaliana. 
ESTs gb|Z34l65 and gbjZlS788 
:ome from this ^ene. 
Arabidopsis thaliana] 


aoo: 


1586 


I 
I 

AF 100694 c 


Vlus muscuius 
taitiu52 mRNA, 
omplete cds 


le-28 


I 
( 

118583 


DEHYDRIN DHN3 
>gi|l00035jpir||S IS 139 dehydrin 
3HN3 - garden pea >gi|20709 
X63063) pea dehydrin DHN3 
Pisum sativum! 


0.001 
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Nearest Neighbor ( BlastN vs. Genbank) 


i Nearest Neighbor ( BlastX vs. Non- Redundant Proteins* 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


p value 


ACCESSION 


DESCRIPTION 


P VALUE 












(AC005990) Contains repeated" 




1587 


AF 100694 


Mus muscuius 
Pontin52 mRNA. 
complete cds 


le-28 


4056454 


region with similarity to 
gb(U43627 extensin (atExtl) 
gene from Arabidopsis thaJiana. 
ESTs gb|234165 and gb|Zi8788 
come from this gene, 
r Arabidopsis thaliana] 


0001 


1588 


AF100694 


Mus muscuius 
Pomin52 rnRNA, 
complete cds 


le-28 


4056454 


(ACOOSiWO) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|ZlS788 
come from this gene. 
[Arabidopsis thalianal 


6e-04 


1589 


AF100694 


Mus muscuius 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(ACUOiWU) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|Z 18788 
come from this gene. 
(Arabidopsis thalianal 


5e-04 


1590 


AF 1 00694 


Mus muscuius 
Pontin52 rnRNA, 
complete cds 


le-28 


4056454 


(ACUODyyO) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana, 
ESTs gb|Z34165 and gb|Z 18788 
come from this gene, 
(Arabidopsis thaliana] 


5e-04 


1591 


AF1 00694 


Mus muscuius 
Pontin52 rnRNA. 
complete cds 


le-28 


118588 


DEHYDRlN DHN3 
>gi| 1 0OO35|pir||S 18139 dehydrin 
DKN3 - garden pea >gi|20709 
(X63063) pea dehydrin DHN3 
Pisum sativum] 


2e-04 


1592 


AF100694 


Mus muscuius 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(AC005990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana, 
ESTs gb|Z34165 and gbjZlS788 
come from this gene. 
Arabidopsis thalianal 


2e-04 


1593 


J 

AF 100694 < 


Mus muscuius 
Pomin52 mRNA, 
romplele cds 


le-28 


< 

4056454 | 


LACUUD990) Contains repeated 
region with similarity to 
zbjU43627 extensin (atExtl) 
^ene from .Arabidopsis thaliana. 
ESTs gb|Z34l65 and -b|Z 18788 
:ome from this gene. 
Arabidopsis thaliana] 


5e-05 



WO01/02568 



PCT/US00/I8374 



SEQ 
ID 


Nearest 
ACCESSION 


Neighbor (BtasiN vs. < 
/ DESCRIPTION 


Denbank) 
P VALUE 


ACCESSION 


nor (DiastA vs. iMon-Keaujiaant P 
DESCRIPTION 


toteins) { 
P VALUE 


1594 


AF100694 


Mus musculus 
Poniin52 rnRNA, 
complete cds 


lc-28 


4056454 


Contains repeate3" 
region wan similarity to 
gb)U43627 extensin (atExtl) 
gene from Arabidopsis thaliana, 
ESTs gb|Z34165 and gb|Zl87S8 
come from this gene. 
[Arabidopsis thaliana) 


5e-05 


1595 


AF100694 


Mus musculus 
Pomin52 mRNA, 
complete cds 


le-28 


4056454 


(ACOO^tWO) Contains repeated" 
region with similarity to 
gb|U43627 extensin (aiExti) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34 165 and gb|Z 18788 
come from this gene, 
f Arabidopsis thaliana} 


Ie-05 


1596 


AF100694 


Mus musculus 
Ponlin52 mRNA t 
complete cds 


le-28 ! 


4056454 


(ALUOD990) Contains repeated" 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana, 
ESTs gb|Z34I65 and gb|2iS788 
come from this gene. 
Arabidopsis thaliana] 


le-05 


1597 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


lc-28 


4056454 


Al'OO^O) Contains repeated" 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|Zl8788 
come from this gene. 
[Arabidopsis thaliana I 


9e-06 


1598 


AF100694 


Mus musculus 
Pontin52 mRNA, 
:omplete cds 


lc-28 


i 
1 
< 

4056454 


[AtaftdH)) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|Z 18788 
:ome from this gene. 
Arabidopsis thaliana] 


6e-06 


1599 


I 
I 

AF100694 c 


vtus musculus 
J ontin52 mRNA. 
omplete cds 


le-28 


i 

f 
I 

c 

4056454 


ACU05990) Contains repeated 
egion with similarity to 
tb|U43627 extensin (atExtl) 
jene from Arabidopsis thaliana, 
:STs gb|Z34 1 65 and gb[Z 18788 
Ome from this gene. 
Arabidopsis thaliana] 


5e-06 


1600 


P 
I 

AF100694 c 


id us musculus 
'011(1052 mRNA, 
omplete cds 


le-28 


F 
F 

544357 F 


tNA BINDING PROTEIN 
US/TLS protein [human, 
cptide. 526 aal [Homo sapiens] 


4e-06 



WO 01/02568 
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Nearest Neighbor (BlasiN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












{ ALUUoyyu) Contains repeated 




1601 


AF100694 


Mus musculus 
Pomin52 mRNA. 
complete cds 


le-28 


4056454 


region with similarity to 
gb|U43627 extensin (atExti) 
gene from Arabidopsis thai i ana. 
ESTs gb|Z34l65 and gb|Z187S8 
come from this gene, 
[Arabidopsis thaliana] 


2e-06 


1602 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28. 


4056454 


(ACOU^yyU) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExti) 
gene from Arabidopsis thaliana. 
ESTs gb|234l65 and gb|Zl8788 
come from this gene. 
(Arabidopsis thai i ana ] 


2e-06 


1603 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(£Ct»}990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExti) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|Z 18788 
come from this gene. 
[Arabidopsis thaliana] 


9e-07 


1604 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le*23 


4056454 


(ACUUD9VU) Contains repeated 
region with similarity to 
gb|U43627 extensin (arExt 1 ) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|Z1878S 
come from this gene. 
[Arabidopsis thaljanal 


8e-07 


1605 


AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 




U69643 


fMRFAMIDE-HELAteO 
NEUROPEPTIDES 
PRECURSOR >gi|4l620S 
(U03137) neuropeptide 
precursor FMRf amide-related 
peptide [Lymnaea stagnalis] 


7e-07 


1606 


AF100694 


Mus musculus 
Ponlin52 mRNA, 
complete cds 


le^2S 


4056454 


(ACU03990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExti) 
gene irom ,-vraQiaopsii uiJiiana. 
ESTs gb|Z34l65 and gb|Zl8788 
come from this gene. 
[Arabidopsis thaliana] 


6e-0~ 


1607 


AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


leOS 


4056454 


(AC00j990> Contains repeated 
region with similarity to 
gb|U43627 extensin (atExti) 
gene from Arabidopsis thaliana. 
ESTs gb|234165 and gbIZ 1 87SS 
come from this gene, 
[Arabidopsis thaliana) 


5e-0~ 



e 9 1. 



WO 01/02568 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non- Redundant Proteins) 


SEQ 
ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












(Acm&yvu) Contains repeated 




1608 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


region with similarity to 
gb)U43627 extensin (atExtl) 

nana FlYMYl \ 1*1 KirffMICl C thllllnt 

EeiiC iium .^j auiuujjiia uitiiiujiti. 

ESTs gb|Z34165 and gbjZ!8788 
come from this gene. 
[Arabidopsis thaiiana) 


3e-07 


L609 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


lc-2& 


4056454 


(AC0OD99U) Contains repeated 
region with similarity to 
gb|U43657 extensin (atExtl) 

naiu Frr\m AriKt/innOC fh"wtari^i 
KCilC llUlII AI«l(/IUU(/9it> UIUIImIIO-. 

ESTs gb|Z34165 and gb|Zl8788 
come from this gene, 
[Arabidopsis thalianaj 


. le-07 


1610 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le^ZS 


4056454 


(ACOOiyyO) Contains repeated 
region with similarity to 
gbjl!43627 extensin (atExtl) 
gene rrom .•Araoiuopsis inauaiia. 
ESTs gb|Z34165 and gb|ZiS788 
come from this gene. 
[Arabidopsis thaiiana) 


le-Q7 


1611 


AFL00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(AC0Q599O) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 

gene iruni .-vi«iuiu.upaia uunuua. 
ESTs gb|Z34165 add gb|Z18788 
come from this gene. 
[Arabidopsis rhaliana] 


7e-08 


1612 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(AC(Xb9ik)) Contains repeated 
region with similarity to 
gbju43627 extensin (atExtl) 
gene from .Arabidopsis thaiiana. 
ESTs gb|Z34165 and gb|Zl878S 
come from this gene. 
TArabidopsis thalianal 


2e-08 ; 


1613 


AF100694 


Mus musculus 
Pontin52 mRNA. 
complete cds 


le-28 


4056454 


(AOXBiWd) Contains repeated 
region with similarity to 
gb[U43627 extensin (atExtl) 
gene from .Arabidopsis thaiiana. 
ESTs gb|Z34165 and gb|ZlS788 
come from this gene. 
[Arabidopsis thaiiana] 


6e~09 



WO 01/02568 
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Nearest Neighbor CBIasiN vs- Gcnbank) 


Nearest Neiahbor (BlascX vs. Non-Redundant Proteins) 


SEQ 
CD 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












(ACOOD990) Contains repeated 




1614 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 ! 


4056454 


region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|Z18788 
come from this gene. 
[Arabidopsis thaliana] 


5e-09 


1615 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le*28 


4056454 


(AC005990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34l65 and gb|Z18788 
come from this gene, 
f Arabidopsis thaliana] 


4e~09 


1616 


AF10O694 


Mas musculus 
Pontin52 mRNA, 
complete cds 


le-2S 


4056454 


(AC003*U) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from .Arabidopsis thaliana, 
ESTs gb|Z34l65 and gb|Z 18788 
come from this gene. 
(Arabidopsis thaliana) 


7e-l0 


1617 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


Ie-2S 


4056454 


(AC005990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gbjZ34l65 and gb|Zl8788 
come from this gene. 
(Arabidopsis thaliana] 


6e-10 


1613 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


lc~28 


4056454 


(ACUU5990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana, 
ESTs gb|Z34165 and gb|Zl8788 
come from this gene. 
[Arabidopsis thaliana] 


5c- 10 


1619 


AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(.ACUU^yyO) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|Z 18788 
come from this gene, 
[Arabidopsis thaliana] 


4e-I0 
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Nearest 


Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs, Non-Redundant Pmteinei 


SEQ 
ID 


ACCESSI01N 


f DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












(Adflfoyvh) Contains repeated 


1620 


AF100694 


Mus musculus 
Pontin52 mRNA* 
complete cds 


le-28 


4056454 


region wuh similarity to 
gbf(J43627 extensin (atExtl) 
gene from Arabidopsis thai i ana. 
ESTs gb|Z34165 and gb|Z 18788 
come from this gene* 
[Arabidopsis thoiiana] 


2e-l0 


1621 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(acuuwO) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34i65 and gb|Zl8788 
come from this gene, 
[Arabidopsis thaliana] 


Sell 


1622 


AF 100694 


Mus musculus 
Pontin52 rnRNA, 
complete cds 


lc-28 


4056454 


(AcJlktoyW)) contains repeated 
region with similarity to 
gb)U43627 extensin (atExtl) 
gene from Arabidopsis thaliana, 
ESTs gb(Z34165 and gb|Z18788 
come from this gene. 
[Arabidopsis thaliana] 


2e-12 


1623 


AF032896 


Petromyzon marinus 
polyadcnylate binding 
protein 


le28 


1082703 


polyadcnylate binding protein II 
luman 


2e-27 


1624 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


9e-29 


118588 


DEHYDRIN DHN3 
>gi| 1 00035|pi r||S t S 1 39 dehydrin 
DHN3 - garden pea >gi|20709 
(X63063) pea dehydrin DHN3 
[Pisum sativum] 


0.013 


1625 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


9e-29 


2133579 


spermatophorin Sp23 - yellow 
mealworm molitor) 


6c^04 


1626 


AF100694 


Mus musculus 
Pontin52 mRNA, 
jomplete cds 


9e-29 


■ 

3876465 < 


(Z81071) predicted using 
Genefinder; Similarity to 

f-tiirmn cmill nn^l^'ii- 
iiutilall ill 1 JJ 1 nUCiCiil 

ribonuclcoprotcin E cDNA EST 
yk375g7.5 comes from this 
zene; cDNA EST yk435f5 ; 3 
;omes from this sen... 


9e-06 


1627 


t 

] 
1 

AF100694 U 


Mus musculus 
=»ontin52 mRNA, 
:omplete cds [ 


8e-29 ; 


I 
i 

I 
I 
\ 
i 

4056454 


AC005990) Contains repeated 
cgion with similarity to 
;b|U43627 extensin (atExtl) 
tene from Arabidopsis thaliana. 
ZSTs gb|Z:>4l65 and gb;Z187SS 
omc from this gene, 
Arabidopsis thaliana] 


2e-06 



,0% 
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Nearest Neighbor (BlastN vs, Genbank) 


Nearest Neighbor (BlasLK vs, Non-Redundant Proteins* 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 
aDP-RIBOSYLaIToN 


P VALUE 


I62S 


AF100694 


Mus muse ul us 
Pontin52 mRNA, 
complete cds 


4e-29 


728883 


FACTOR 3 fruit fly (Drosophila 
melanogaster) >gi|507234 
(L25063) ADP ribosylation 
factor 3 [Drosophila 
melanoeasterl 


0,016 


1629 


AF 100694 


Mus musculus 
Ponun52 mRNA, 
complete cds 


4c~29 


544357 


RN A- BINDING PROTEIN 
FUS/TLS protein [human. 
Peptide, 526 aa] [Homo sapiens] 


2e-07 


1630 


AF100694 


Mus musculus 
Pontin52 rnRNA, 
complete cds 


4c-29 


4056454 


(aLuU3990> Contains repeated 
region with similarity to 
gb)U43627 cxtensin (atExtl) 
gene from Arabidopsis thaJiana, 
ESTs gb|Z34l65 and gb|Z 18788 
come from this gene. 
[Arabidopsis thaliana] 


lc-08 


163L 


D43682 


Human mRNA for 
very-long-chain aeyl- 
CoA dehydrogenase 
(VLCAD), complete 
cds 


4e-29 


1168287 


ACVL-CUA " 
DEHYDROGENASE. VERY- 
LONG-CHAIN SPECIFIC 
PRECURSOR (VLCAD) 
dehydrogenase precursor - rat 
Acy I -Co A dehydrogenase 
[Rattus norvegicus] 


6e-37 


1632 


Y07660 


M.tuberculosts accBC 
^enc 


4e-29 j 


21 13935 


(Z95556) accDi 
XMvcobacterium tuberculosis] 


3e*47 


1633 


X55367 


Human alpha-satellite 
DNA from clone 
pTRA=2. 


le-29 


<NONE> 


<NONE> 


<NONE> 


1634 


L81866 


Homo sapiens 
(subclone l_f I from 
PI H54) DNA 
sequence 


le-29 


<NONE> 


<NONE> • 


<NONE> 


1635 


S75940 


Alu repeats, clone 
52H10) [human, 
colonic mucosa. 
Genomic, 943 ml 


le-29 


728831 


!!!! ALU SUBFAMILY J 
WARNING ENTRY 


le^07 


1636 


AB001907 


Homo sapiens 
*ACE4 gene, e.xon 13 


le»29 


728831 


!!! ALU SUBFAMILY J 
WARNING ENTRY 


2e-09 


1637 


< 

AF077003 i 


vtus musculus $H3 
iomain-conraining 
idapier protein 
uRNA, complete cds 


5c-30 


<NONE> 


<NONE> 


<NO.VE> 




WO 01/02568 



PCT/US00/18374 





[ Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












(Ac6o^yvu) Contains repeated 




1638 


AF100694 


Mus musculus 
Pontin52 mRNA* 
complete cds 


4e-30 


4056454 


region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|Z18788 
come from this gene. 
[Arabidopsis thaliana] 


3e-10 


1639 


M27072 


Xenopus laevis 
poiy( A)- binding 
protein (ABP-EF) 
mRNA. complete cds. 


4c-30 


' 1352709 


POLYADENYLATE- 
BINDLNG PROTEIN 
polyadenylate-binding protein * 
African clawed frog laevisl 


5e-2l 


1640 


X58386 


B.tatmismRNA for 
bovine vacuolar 
ATPasft ^tihunit A 


2e-30 


2773154 


(AF039573) abscisic acid- and 
scress-i nducible protein 


4,3 


1641 


Y07660 


M.tuberculosis accBC 
fi ene 


le-30 


2113935 


(Z95556) accDi 
[Mycobacterium tuberculosis] 


4e-47 


1642 


AJ236940 


Sus scrofa mRNA for 
hypothetical protein 
(5*; clone 7C4) 


4e-31 


4102021 


(AF007561) delta 6-desaturase 
[Borage officinalis) 


7.4 


1643 


AF03940O 


Homo sapiens 
calcium-dependent 
chloride channel- 1 
(hCLCAl) mRNA. 
complete cds 


2e-31 


3721912 


(ABOl7156)gob-5 [Mus 
musculus] 


7e^08 


1644 


L77036 


Homo sapiens 
(subclone 5_d9 from 
PI HL9^ DNA 
sequence. 


le-3l 


461663 


fiOMBYXiN B-2 H<M>L6fl 
PRECURSOR silkmoth 
>gi|2l7385|gnl|PID|d 1003528 
(D 13924) Samia bombyxin 
homolo? B-2 [Samia cynthia) 


11 


1645 


X61971 


Rsapiens mRNA tor 
macropain subunit 
delta 


le-3l 


296734 


(X61971) macropain subunit 
delta [Homo sapiensl 


3e-06 


1646 


L00016 


rmman mitochondrial 
tmas and partial 
proteins 4& 5; 
histidyk seryN, 
leucyl-trna genes; 
urt4 and urf5 
(partial). 


5e-32 ; 


4056454 


(ACU05940) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34l65 and gb|Z13783 
come from this gene, 
[Arabidopsis thaliana] 


0.002 


1647 


M17S37 


Human acidic 
ribosomal 
phosphoproiein P2 
mRNA, complete cds- 


5e-32 


4056454 


( AC00;>9yU) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34l65 and gb|2l37S8 
come from this gene. 
(Arabidopsis thaliana] 


le-05 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlasiX vs. Non-Redundanc Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 




Human mitogen- 










1659 


U53446 


responsive 

phosphoprotein DOC- 
2 mRNA, complete 
cds. 


6e^34 


3395443 


(AC004683) putative 
ammonium transporter, 3* partial 


4.7 


1660 


ARH3988 


Homo sapiens serine 
protease mRNA, 
complete cds 


4e-34 


' 2507226 


PROTEIN-TYROSINE 
PHOSPHATASE EPSILON 
PRECURSOR (R-PTP- 
EPSILON) >gi| 1439603 
(U62387) protein tyrosine 
phosphatase-e [Mus musculusl 


3,2 


1661 


U53446 


Human mitogen- 
responsive 

phosphoprotein DOC- 
2 mRNA* complete 
cds, 


2c-34 


104757 


LEP100 protein precursor - 
chicken >gi|2 12254 gallus] 


1.6 


1662 


AJ233632 


Homo sapiens 
endogenous retroviral 
sequence ERV-L pol 
gene, clone ERV-L 
Human6 


2e-34 


3860513 


(AJ233597) reverse 
transcriptase [Mus famulus] 


4e-l0 


1663 


AF086310 


Homo sapiens full 
length insert cDNA 
clone ZD51F08 


8e-35 


2947070 - 


(AC00252I) putative Ser/Thr 
protein kinase [Arabidopsis 
thaliana] 


2,3 


1664 


X 17206 


Human mRNA for 
LLRcp3 


3e-35 


730652 


40S RIBOSOMAL PRUlblN 
S2 (STRINGS OF PEARLS 
PROTEIN) 

>gi|l085l5S|pir||S50325 
ribosomal protein S2 - fruit fly 
(Drosophila melanogaster) 
melanogaster] >gi (51597 2 
(U01335) ribosomal protein S2 


2e-10 


1665 


AB011137 


Homo sapiens mRNA 
tor KIAA0565 
protein, complete cds 


3e-35 


3043654 


(AB0U137) KIAA0565 protein 
[Homo sapiens! 


2c- 16 


1666 


U62801 


Human protease M 
mRNA. complete cds 


2e^35 


3929231 


(AF091247) potassium channel 
[Rattus norve^icus] 


1.0 


1667 


AF020760 


Homo sapiens serine 
protease (Omi) 
mRNA, complete cds 


le-35 


2738915 


t AF020760) serine protease 
[Homo sapiensl 


9e-U 
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Nearest Neighbor (BlastN vs. Genbank) 


. Nearest Neighbor [BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Human DNA 










1668 


Z93943 


sequence from 
cosmid U235H3 on 
chromosome X 


8e-36 


1196432 


(M22333) unknown protein 
[Homo sapiens] 


3c* 10 


1669 


X06778 


Rabbit 18S rRNA 


7e-36 


118588 


bEHYbRINDftN3 
>gi|100O35|pir||Sl8139 dchydrin 
DHN3 - garden pea >gi|20709 
(X63063) pea dehydrin DHN3 
[Pisum sativum] 


0.011 


1670 


AB007962 


Homo sapiens 
mRNA* chromosome 
1 specific transcript 
K1AA0493 


3e-36 


3329243 


(AE001350) hypothetical 
protein [Chlamydia trachomatis] 


3.1 


1671 


Z81014 


Human DNA 
Sequence from 
cosmid U65A4, 
between markers 
DXS366 and DXS87 
on chromosome X * 


3e-36 


141103 


HYPOTHETICAL PROTEIN 
ORF-U37 mouse 


0.038 


1672 


Z81014 


Human DNA 
sequence from 
cosmid U65A4, 
between markers 
DXS366 and DXSS7 
on chromosome X * 


3e-36 


198651 


(M29325) ORF1 [MuS 
musculus[_ 


0.006 


\b/o 


U49082 


Human transporter 
protein (gl7) mRNA. 
complete cds 


3e 36 


1840045 


(U49082) transporter protein 
[Homo sapiens] 


2e45 


1674 


J03133 


Human transcription 
factor SPL mRNA, 3* 
end. 


3e-36 


477133 


HF-l regulatory element binding 
protein - rat 


2e-31 


1675 




Homo sapiens mRNA 
for KIAA0465 
protein, partial cds 


le-jo 




(AB0O7934) KIAA0465 protein 
[Homo sapiens] 




1676 


M34S57 


Mouse Hox-2.5 
mRNA. 


9e-37 


106296 


homeotic protein Hox B9 - 
human (fragment) 


0.15 


1677 


L35657 


Homo sapiens 
(subclone H8 5^al0 
from PI 35 H5C8) 
DNA sequence. 


9e07 


2072960 


(U93568) p40 [Homo sapiens] 


3e-05 


1678 


XS0240 


H.sapiens 
endogenous 
retrovirus HERV- 
KC4 DNA 


8e-37 


4185944 


(Y17S33) env protein [Human 
endogenous retrovirus K] 


le-15 



?5 
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Nearest Neighbor (BlastN vs, Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Human DNA 










1679 


! Z93943 


sequence from 
cosmidU235H3 on 
chromosome X 


9e-38 


106322 


hypothetical protein (L1H 3* 
region) - human 


4e-13 


1680 


X97303 


Rsapicns mRNA for 
Pt*- 12 protein 


4e08 


466044 


HOTTHh 1'ICAL Z1NL " 
FINGER PROTEIN ZK686 4 

TXT fXSD f^KAf^i^fWATl TTT 
iri CHKUMvJoLJIvLc ill 

>gi|63078D)pir||S44909 ZK686.4 
protein - Caenorhabditis elcgans 
>gi|304346 (L17337) coded for 
by C. elegans cDNAs 
GenBank:M88869 and T01933; 
putative [Caenorhabditis 
eleeansl 


3e07 


1681 


Y08999 


Rsapicns mRNA for 
Sop2p-tike protein 


3e-38 


3334339 


SOP2-LIKE PROTEIN 


5e-06 


1682 


262887 


Rsapiens CpG DNA, 
clone 74g6, forward 
read cpg74»6.ftla , 


2e-38 


1245686 


(U53181) F36D4.2 gene 
product [Caenorhabditis 
elegans) 


0.19 


1683 


U35032 


Human endogenous 
retrovirus clone 
cS.ll, HERV-H 
multiply spliced 
subgenomic leader, 
protease and integrase 
region mRNA, partial 
cds 


le-38 


59977 


(Z 143 10) tripartite fusion 
transcript PLA2L [Human 
endogenous retrovirus) 


lc-06 


1684 


D86974 


Human mRNA for 
KIAA0220 gene, 
partial cds 


le-38 


3337386 


(AC002544) Unknown gene 
product splice form-2 [Homo 
sapiens] 


8e-ll 


1685 


M31013 


Human nonmuscle 
myosin heavy chain 
(NMHC) mRNA, 3* 
end. 


le-33 


4H5748 


(AB022023) nonmuscle myosin 
heavv chain B 


2e-H 


16S6 


AF0O6087 


Homo sapiens Arp2/3 
protein complex 
subunit p20-Arc 
(ARC20) mRNA, 
complete cds 


4e-39 


<NONE> 


<NONE> 


<NONE> 


1687 


' X58374 


D.melanogasier cm 
mRNA 


le-39 


2655888 


(AL0O917l)62D9.a 
[Drosophila melanogaster] 


4e-42 


1688 


D85815 


Human DNA for 
rhoHPl. complete cds 


le-39 


134080 


GTP-BINDING PROTEIN 
TC10 ras-tike protein [Homo 
sapiens} 


3e-26 
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Nearest Neighbor (BlasiN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1689 


U49057 


Rattus norvegicus 
CTD-binding SR-like 
protein rA9 mRNA, 
complete cds 


4e-40 


1438534 


(U49057) rA9 [Rattus 
norvegicus) 


5e-Q5 


1690 


Y08999 


H .sapiens mRNA for 
Sop2p-|jke protein 


4e-40 


3334339 


SOP2-LIKE PROTEIN 


9e-08 


169 L 


AB002293 


Human mRNA for 
K1AA0295 gene, 
partial cds 


4e-40 


2224531 


(AB002293) KIAA0295 [Homo 
sapiens) 


lc-30 


1692 


AF086222 


Homo sapiens full 
length insert cDNA 
clone ZC66E08 


le-40 


• 

2829669 


DOUBLE-STRANDED RNA- 
SPECIFIC EDITASE I 
(DSRNA ADENOSINE 
DEAMINASE) (RNA 
EDITING ENZYME I) 
>gi| l707502|gn!|PID|e254627 
(X99227) double-stranded RNA- 
specific editase (Homo sapiens] 
editase 1 hREDl-L [Homo 
sapiens] >gi|2039300 (U76421) 
dsRNA adenosine deaminase 
DRADA2b [Homo sapiens] 


0.61 


1693 


AF044127 


Homo sapiens 
peroxisomal short- 
chain alcohol 
dehydrogenase 
(SCAD-SRL) mRNA, 
complete cds 


Ie-40 


4105190 


(AF044I27) peroxisomal short- 
chain alcohol dehydrogenase 


2e-06 f 


1694 


U36778 


Mus musculus Sil 
mRNA. complete cds 


le-40 


88608 


SIL protein - human >gi|33S083 
(M74558) SIL 


6e*23 


1695 


U36778 


Mus musculus Sil 
mRNA. complete cds 


le-40 


88608 


SIL protein - human >gi|33S0S$ 
(M74558) SIL 


6e-23 


1696 


U36778 


Mus musculus Sil 
mRNA. complete cds 


le-40 


88608 


SIL protein - human >gi|3380SS 
(M74558) SIL 


5e-23 


1697 


U36778 


Mus musculus Sil 
mRNA, complete cds 


le-40 


88608 


SIL protein - human >gi|33SOSS 
(M74558) SIL 


5e-23 


1698 


AB01S2S5 


Homo sapiens mRNA 
for KIAA0742 
protein, partial cds 


le-40 


3882205 


ABO 18285) KIAA0742 protein 
Homo sapiens] 


6e-3l 
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Nearest Neighbor (BlastN vs. Genbank) 


! Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION - 


DESCRIPTION 


P VALUE 












ATP-B1NDING CASSETTE 




1699 


X75927 


M.musculus abc2 
mRNA 


Ie-40 


728773 


TRANSPORTER 1 ABC1 - 
human >gi|495257 (X75926) 
abcl (Mus musculus) 


3e-37 


1700 


AF038200 


Homo sapiens clone 
23954 mRNA 
sequence 


5e-41 


3211975 


(AF068195) putative 
glialblastoma cell differentiation- 
related protein [Homo sapiens] 


5e-14 


1701 


U20521 


Human estrogen 
sulfotransfcrasc 
(STE) gene, exoii 8 
and complete cds 


4e-41 


• <NONE> 


<NONE> 


<NONE> 


1702 


AF026548 


Homo sapiens 
branched chain alpha- 
ketoacid 

dehydrogenase kinase 
precursor, mRNA, 
nuclear gene 
encoding 
miiochondrial 
protein, complete cds 


2*41 


3182923 


[3-METHYL-2- 
OXOBUTANOATE 
DEHYDROGENASE 
(LIPOAMIDE)] KINASE 
PRECURSOR alpha-ketoacid 
dehydrogenase kinase precursor 
(Homo sapiens] 


2e^09 


1703 


Y07660 


M.tuberculosis accBC 
E22 , 


2e-41 


465847 


HYPOTHETICAL 66.5 KD 
PROTEIN F02A9.5 IN 
CHROMOSOME III 
>oiP8054' 1 lDirllS o 8313 
hypothetical protein F02A9,5 - 
Caenorhabditis elegans 
Gcncfinder: similar to Propionyl 
CoA carboxylase beta chain; 
cDNA EST EMBL:M8901S 
comes from this gene; cDNA 
EST EMBL:D28069 comes 
from this gene; cDN A bo i 
EMBL:D2S068 comes from this 
gene; cDNA EST 


3e-38 


1704 


AG001237 


riomo sapiens 
genomic DNA,21q 
region, clone: 
9H11N46 


le-41 


106322 


hypothetical protein (L1H 3' 
region) - human 


5^09 


1705 


AB007934 


Homo sapiens mRNA 
for KIAA0465 
protein, partial cds 


le-41 


3413892 


(AB007934> FCIAA0465 protein 
[Homo sapiens! 


3e-12 


1706 


AF055029 


Homo sapiens clone 
24711 mRNA 
sequence 


5c-42 


3250681 


(AL0244S6^ putative protein 


2.2 
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Nearest Neighbor (BlastN vs. Gcnbank) 


Nearest Neiehbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 










j _ 




1707 


Z49747 


O.cuniculus mRNA 
for phospholipase C 


5e-42 


130227 


PHOSPHAllDVLlNOSriOL- 

4,5-BlSPHOSPHATE 
PHOSPHODIESTERASE 
npr ta l (Pi f^nFT TA-n 
(PHOSPHOLIPASE C-DELTA- 
l)(PLC-m) >gi|163538 
(MZOoJo) pnospnoiipase w-iu 
[Bos taurus] 


5e-36 


17Uo 


M93651 


Human set gene, 
complete cds. 


2e-42 


<NONE> 


<NONE> 


<NONE> 


1709 


AJ236940 


Sus scrofa mRNA for 
hypothetical protein 
(5':ctonc7C4) 


2e-42 


2062403 


(U79010) delta 6 desaturase 
[Boraso officinalis) 


8.5 


1710 


J03634 


Human erythroid 
differentiation protein 
mRNA 


2e-42 


1708436 


INHIBIN BETA A CHAIN 
PRECURSOR 


2e-10 


1711 


AJ2->3777 


Mus musculus mRNA 
for striatin 


6c-43 


2494917 


STRIATIN 

>gi|I49D7 /^l2nl|rUJ|ezj4i3o 




1712 


AF0164U 


Homo sapiens 
potassium channel 
subunitKCNA3.1B 


2e-43 


270S514 


/ A t?r\ 1 £. t t t \ f/^VT V 1 ID f Ur\mrs 

(ArOlo4-ll) KClN.aj.iu [norno 
sapiens] 


3e-l3 


1713 


AC001443 


Homo sapiens 
(subclone 2^10 from 
BAC 2913 


le-43 


111814 


hypothetical protein 3 - rat 
>si|565S9 


2e-06 


1714 


X32S95 


Rsapiens mRNA for 
DLG2 


6e-44 


2497511 


MA6UK P55 SUBFAMILY 
MEMBER 2 <MPP2 PROTEIN) 
(DISCS. LARGE HOMOLOG 
2) 


6e-52 


1715 


U17077 


Human BENE 
mRNA. partial cds. 


3e-44 


53912 


(X57960) ribosomal protein L7 
[Mus musculus] >gi|55489 


8e-30 


1716 




Homo sapiens mRNA 


2e-44 


<NONE> 


<NONE> 


<N0NE> 


1717 


J03634 


Human erythroid 
differentiation protein 
mRNA 


2e-44 


124279 


UN HI BIN BhlA ALHAlN 
PRECURSOR PROTEIN) 
(EDF) >gi|S7936|pir||B2424S 
inhibin beta- A chain precursor - 
human >gi|lSl947 (J03634) 
erythroid differentiation protein 
precursor [Homo sapiens] 
sapiens] 

>gi|22b$50|prfl|l60S260B 
inhibin betaA [Homo sapiens] 


0.73 j 
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Nearest Neighbor fBlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1718 


AB014518 


Homo sapiens mRNA 
for KIAA0618 
protein, complete cds 


7e-45 


1911548 


(S80864) cytochrome c-like 
polypeptide sapiens] 


1.6 


1719 


X76808 


H^apiens genomic 
DNA clone d2 


7e-45 


868201 


(U29380) similar to adenylate 
cyclase [Caenuihabditis elegans) 


2e^09 


1720 


AB021288 


Homo sapiens mRNA 
for beta 2- 
microglobulin, 
complete cds 


2c-45 


2465521 


(U95995) RNA-dependent RNA 
polymerase [Cryptosporidium 
parvurn] 


0.15 


1721 


X63468 


Ksapicns mRNA for 
transcription factor 
TFIIE alpha 


8e-46 


<NONE> 


<NONE> 


<NONE> 


1722 


AF019226 


Homo sapiens D2-2 
mRNA, 3'UTR 


7e-46 


<NONH> 


<NONE> 


<NONE> 


1723 


D31764 


Human mRNA for 
KIAA0064 gene, 
complete cds 


2e-46 


3123050 


HYPOTHETICAL PROTEIN 
KIAA0064 


le-15 


1724 


K02774 


Human MUC class II 
HLA-DR-beta-psi 
(DW4/DR4) 
pseudogene, exons 
3,4, 5,6, clones cos II- 
3301 and cosII^SOl. 


le-46 


4185946 


(Y 17834) gag protein [Human 
endogenous retrovirus K] 


2c- 14 


1725 


X92109 


H.sapicns hcalX gene 


9e-47 


2498185 


BRIDE OF SEVENLHSiS 
PROTEIN PRECURSOR 
>gi| 1079 l66|pir||A47550 bride 
of sevenless precursor - fruit fly 
(Drosophila virilis) >gj|290216 
virilis] i 


1.4 


1726 


X93334 


H.sapiens 

mitochondrial DNA, 
complete senome 


8e-47 


128753 ! 


HADH-UBIQUINONE 
OXIDOREDUCTASE CHAIN 
4>gi|86696|pir)|A00435 NADH 
dehydrogenase (ubiquinone) 


4c-I5 


1727 


M 85 145 


Human tumor 
necrosis factor 
receptor, 3' flank. 


3e-47 | 


<NONE> 


<NONE> 


<NONE> 


1728 


XSQ240 


H.sapiens 
endogenous 
retrovirus HERV- 
KC4 DNA 


3e-47 


41S5944 


(Y17S33) env protein [Human 
endogenous retrovirus K] 


7e-l8 


1729 


Z63594 


Rsapicns CpG DNA. 
clone 87t9. forward 
read cpK87f9.fi la - 


le^47 


3322743 


(AE001222) T. pallidum 
predicted coding region TP0454 


2,4 
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WO 01/02568 



PCT/US00/18374 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) . 


SEQ 
IP 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 




Rjrartus mRNA for 










1730 


X62295 


vascular type- 1 
angiotensin II 
receptor 


A-AQ 


1209756 


(U43629) integral membrane 
protein [Beta vulearisl 


lc*07 


1731 


M85145 


Human tumor 
necrosis factor 
rcccptor t 3' flank. 


3e-48 


<NONE> 


<NONE> 


<NONE> 


1732 


AB0207I2 


Homo sapiens mRNA 
for KJAA09O5 
protein, complete cds 


4e-49 


4240299 


(AB020712) KIAA0905 protein 
[Homo sapiens] 


2e-20 


1733 


AB020712 


Homo sapiens mRNA 
for KJAA0905 
protein, complete cds 


3e-49 


4240299 


(ABO20712) KIAA0905 protein 
[Homo sapiens] 


2e-:o 


1734 


X62295 


R.rattus mRNA for 
vascular type- 1 
angiotensin II 
receptor 


le-49 


1209756 


(U43629) integral membrane 
protein [Beta vulgaris) 


7e-12 


1735 


AJ0O75O9 


Homo sapiens mRNA 
forE!B-55kDa- 
associated protein 


le-49 


3319956 


(AJ007509)EiB-55kDa- 
associaced protein 


( 

4e-24 


1736 


X97303 


H.sapiens mRNA for 
Ptg- 1 2 protein 


le-49 


466044 


HTPOlHlfllLALZINL 
FINGER PROTEIN ZK686.4 
IN CHROMOSOME III 
>gi|630730|pir||S44909 ZK686-4 
protein - Caenorhabditis elegans 
>gi|304346 0-17337) codedfor 
by C- elegans cDNAs 
GenBank:MSS869 and T01933; 
putative [Caenorhabditis 
elegans] 


! 

8cO< ' 


1737 


AF038404 


Homo sapiens 
homolog of Nedd5 
(hNeddS) mRNA T 
complete cds 


4e-50 


<NONE> 


<NONE> 


*NON 


1738 


L43618 


Homo sapiens 
polycystic kidney 
disease (PKDl) gene, 
exons 34-42 


4e-50 


903758 


(L43619) polycystic kidney 
disease 1 protein (Homo 
sapiens] 


3e-< 


1739 


AF009424 


Homo sapiens clone 
22 mRNA. alternative 
splice variant alpha- 1, 
complete cds 


4eo0 


227 U73 


< AF009426) clone 22 [Homo 
sapiens] 


5e 
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Nearest Neighbor fBlastN vs. Genbank) 


Nearest Neighbor (BlasiX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 










nwrrosaccharid transport protein 




1740 


L77040 


Homo sapiens 
(subclone 8_cll from 
Pi H22) DNA 
sequence. 


2e-50 


99758 


STP4 • Arabidopsis thaliana 
>gi|l6524(X66857) sugar 
transport protein [Arabidopsis 
thaliana) 


6.4 


1741 


L35657 


Homo sapiens 
(subclone H8 5_al0 
irom r i jj nj \^o) 
DNA sequence. 


2c-50 


2072960 


(U93568) p40 [Homo sapiens] 


2e-05 


1742 


U30745 


raontu sapiens 1 vj / «i 

mRNA, partial cds 


ie-50 


<NONE> 


<NONE> 


<NONE> 


1743 


D84514 


Bovine mRNA for 
p97, partial cds 


leoO 


397S527 


(AF103728) structural 
polvprotein [Sindbis virus] 


9.9 


1744 


M*LJLyOV) 


xiunitui prutv*-i*vc 
protein mRNA. 
complete cds. 


leou 


nj ao i 
1 Jluo 1 


EYScSOSIaE protective 
PROTEIN precursor 

(CATHEPSIN A) 

human >gi| 190233 uM22960) 
protective protein precursor 


le- 1+ 


1745 


AoOU 1 o 


Ksapiens mRNA for 
MUr I protein 


*e ->\J 


lUoZOiU 


mufl protein - human 
>gi|762953 (X8601S) mufl 
[Homo sapiens] 


ie~-- 1 


1746 


U03495 


Human transcription 
factor LSF-ID 
mRNA, complete cds. 


7e-5l 


2136296 


transcription factor LSF - human 
>$i|476099 


ic-21 


1747 


A RH 1 SUA 


Homo sapiens 
HRIHFB2157 


JC-J 1 




(AB015344) HRIHFB2157 

fU/\m/\ ciniAncl 

^nouiO sapiens i 




174S 


M93339 


Human zinc finger 
protein mRNA. 


4e-5L 


3024 no 


MYC- ASSOCIATED ZINC 
FINGER PROTEIN sapiens] 


2e-06 


1749 


U71363 


Human zinc finger 
protein zfp6 (ZF6) 
mRNA. partial cds 


4e-5l 


2689441 


(AC0O36S2) FlS547„l [Homo 
sapiens) 


2c 11 


1750 


X56932 


H. sapiens mRNA for 
23 kD highly basic 
protein 


4e-5l 


730451 


5BS KlBDS0MAL.PR0reiN 
L 1 3 A (23 KD HIGHLY BASIC 
PROTEIN) 

>gi|345S97|pir!!S29539 basic 
protein. 23K - human >gi|2369 L 
(X56932) 23 kD highly basic 
protein [Homo sapiens] 


le-11 


1751 


Z79054 


H.sapicns now-sorted 
chromosome 6 
Hindlll fragment. 
SC6pA2lEll 


2e-51 


<NONE> 


<NONE> 


<NONE> 



10 % 
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Nearest Neighbor f BlastN vs. Genbank) 


Nearest Nciehbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Homo sapiens 










1752 


AF068245 


BAF60b gene, partial 
sequence 


5e-52 


<NONE> 


<NONE> 


<NONE> 


1753 


AJ236932 


Sus scrofa mRNA for 
hypothetical protein 
(5'; clone 4B8) 


5e-52 


400927 


RIBONUCLEOPROTEIN 
RB97D ribonucleoprotein 
[Drosophila melanogastcrj 


4J ! 


1754 


AF003693 


Mus musculus 
scaffold protein Pbpl 
homolog mRNA, 
complete cds 


6e-53 


2197106 


(AF003693) scaffold protein 
Pbpl homoloa [Mus musculus] 


2e-54 


1755 


M27319 


Human calmodulin 
mRNA, complete cds. 


5c-53 


115528 


CALMODULIN 
>gi| lO2408|pir|pC 1309 
calmodulin - Stylonychia lemnae 
(SGC5) >ai|161195 


0.002 


1756 


M74555 


Mouse house-keeping 
protein mRNA, 
complete cds. 


5e-53 


284775 


house-keeping protein * mouse 

>»i|I93S7L 
P 1 i i — — r^; 


5e-30 


1757 


X92720 


H. sapiens mRNA for 
phosphoenol pyruvate 
carboxvkinase 


6e-54 


2135915 


phosphoenol pyruvate 
carboxvkinase (GTP) (EC 
4.1.1,32) precursor, 
mitochondrial - human 
carboxvkinase (GTP) (Homo 
sapiens] 


6e-21 


1758 


AF007872 


Homo sapiens torsinB 
(DQ1) mRNA, partial 
cds 


2e-54 


2760121 


(AB002405) LAK-4p [Homo 
sapiens! 


0.27 


1759 


! U49507 


Mus musculus 
B6CBA Lisch7 
mRNA, partial cds. 


2e^54 


1236083 


(U49507) Lisch7 [Mus 
musculus] 


3e-27 


1760 


Z73360 


Human DNA 
sequence from 
cosmid 92M18, 
BRCA2 gene region 
chromosome I3ql2- 
13. 


Ie-j3 


zj /U3/ I 


( Y 146^7) hyarophODin 
[Pleurotus ostreatus] 
>gi|29S2620|gnl|PID|e 1283986 
(AJ22506L) POH2 hydrophobin 
Irieurotu^ ostreatus i 




1761 


U83702 


Human cytochrome c 
oxidase subunit Via 
gene, exon 3 and 
complete cds 


8e-56 


2982994 


(AE0006S2) hypothetical 
protein [Aquifex aeolicus] 


7.0 


1762 


Y12781 


Homo sapiens mRNA 
for transducin (beta) 
like I protein 


7eo6 


3021409 


(Y127SI) transducin (beta) like 
1 protein [Homo sapiens] 


7e-?9 
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Nearest Neishbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1763 


AB020673 


Homo sapiens mRNA 
for KIAA0866 
protein* complete cds 


8e*57 


2104553 


(AF001548) Myosin heavy 
chain (MHYl 1) (5'partial) 
[Homo sapiens 1 


4e-04 


1764 


AJ236932 


Sus scrofa mRNA for 
hypothetical protein 
(5'; clone 4B8) 


3e-57 


400927 


RIBONUCLEOPROTEIN 
RB97D ribonucleoprotetn 
[Drosophila melanogaster] 


4.7 


1765 


L06900 


Human dystrophin 
gene, intron 1 
containing pseudo 
ex on. 


ie-58 


. 4185129 


(AC005724) unknown protein 
[Arabidopsis thaliana] thaliana) 


7.0 


1766 


X93334 


H.sapiens 

mitochondrial DNA, 
complete senome 


9e-59 


1492050 


(U60315) MC107L [Molluscum 
contaeiosum virus subtype 1] 


0.17 


1767 


AF064856 


Rattus sp. 7acomp 
protein mRNA, 
complete cds 


3e-59 : 


3169626 


(AF064856) 7acomp protein 
(Rattus sp.I 


2e-3l 


1768 


AF081484 


Homo sapiens alpha- 
tubulin isoform I 
mRNA, complete cds 


2e-59 


32015 


(X06956) alpha-tubulin [Homo 
salens] 


4e-22 


[769 


X71427 


Homo sapiens mRNA 
forFUS-CHOP 
protein fusion 


le-60 


746557 


(U23523) histidine-rich 
[Caenorhabditis elegans] 


0.45 


1770 


AF013988 


Homo sapiens serine 
protease mRNA. 
complete cds 


le-60 


2564316 


(AB006622) No similarities to 
any reported proteins [Homo 
sapiens] 


0.26 


1771 


U25691 


Mus musculus 
lymphocyte specific 
helicase mRNA, 
complete cds 


7e-6l 


2137490 


lymphocyte specific helicase - 
mouse musculus! 


3e-25 


1772 


X93334 


H.sapiens 

mitochondrial DNA, 
complete genome 


4e-61 


70656 


ubiquitin / ribosomal protein 
S27a • human extension protein, 
HUBCEPS0 [human. Peptide, 
156 au) ubiquitin extention 
protein [Cavia porcellus] 


9e-0S 


1773 


D3S255 ! 


Homo sapiens mRNA 
for CAB 1, complete 
cds 


4e-61 


2135214 


gene MLN 64 protein - human 


4e-23 


1774 


U2569I 


Mus musculus 
lymphocyte specific 
helicase mRNA. 
complete cds 


8e-62 


2137490 


lymphocyte specific helicase - 
mouse musculusl 


Se-26 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1775 


M21731 


Human lipocortin-V 
mRNA, complete cds. 


6e-62 


3212603 


Human Annexin V With Proline 
Substitution By Thioproline 


2e-20 


1776 


AF021936 


Rattus norvegicus 
myotonic dystrophy 
kinase-related Cdc42- 
binding kinase 
MRCK-beta (MRCK- 
beta) mRNA* 
complete cds 


2c«62 


2736153 


(AF021936) myotonic 
dystrophy Jdnase^related Cdc42- 
binding kinase MRCK-beta 
[Rattus norvegicus] 


3e-27 


1777 


Y 12059 


Rsapiens HUNKI 
mRNA 


le-62 


3184498 


(AC004798) R3l546_l [Homo 
sapiens] 


3e-09 


1778 


L37368 


Human (clone E5.1) 
RNA-binding protein 
mRNA. complete cds. 


6e-63 


477578 


sialidase - Actinomyces viscosus 
>gi|14l852 


7.8 


1779 


M27S77 


Figure 2. Nucleotide 
and translated protein 
sequences of HPFL - 
2, and -9. 


5e-63 


1731443 


2WC FlUGtk PROTEIN 83 
(ZINC FINGER PROTEIN 
HPFl) >gi|106023|pir|jA3289l 
finger protein 1, placental - 
human 


3e-33 


1780 


AF095448 


Homo sapiens 
putative G protein- 
coupled receptor 


2e-63 


3116131 


(AL0232SS) hypothetical 
protein 


4.6 


1781 


L 19437 


Human transaldolase 
mRNA containing 
transposable element, 
complete cds 


2e~63 


1553119 


(U63159) transaldolase [Mus 
musculus] 


4e-13 


1782 


L41351 


Homo sapiens 
prostasin mRNA* 
complete cds 


Le-63 


2833277 


precursor - human >gi|862305 
(L41351) prostasin [Homo 
sapiens] >gi|l 143194 (U33446) 
prostasin [Homo sapiens] 


6e-14 


1783 


AF053470 


Homo sapiens lOkD 
protein (BC10) 
mRNA, complete cds 


6e-64 


482237 


hypothetical protein K03H19 - 
Caenorhabditis elegans 


0.029 
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! Nearest Neiehbor 'BlastN vs. Genbank) 


Nearest Neighbor (BlasiX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1784 


D37791 


Mouse mRNA for 
beta- 1,4- 

gaiactosyltransferase 


6e-64 


3880102 


1Z93390J similar to py ve zinc 
finger; cDNA EST yk265b4.5 
comes from this gene; cDNA 
EST yk359g9.5 comes from this 
gene; cDNA EST yk3l9c2.5 
comes from this gene 
[Caenorhabditis elegans] zinc 
finger, cDNA EST yk265b4.5 
comes from this gene; cDNA 
EST yk359g9.5 comes from this 
gene; cDNA EST yk3l9c2.5 
comes from this gene 
[Caenorhabditis elegans] 


3e^l6 


1785 


AF015770 


Mus musculus radical 
fringe (radical- fringe) 
mRNA, complete cds 


6e-64 


2204355 


(U94350) radical fringe 
precursor [Mus musculus] 


le-36 


1786 


Z79054 


H.sapiens flow-sorted 
chromosome 6 
Hindlll fragment. 
SC6pA21Eil 


2e-64 


<NONE> 


<NONE> 


<NONE> 


1787 


M83094 


Homo sapiens 
cytosolic selenium- 
dependent glutathione 
peroxidase gene, 
complete cds. and 
rhohl2 gene. 3' end. 


le-64 1 


2447063 


(U42580) A565R [Paramecium 
bursaria Chlorella virus l| 


8.8 


1788 


Y10211 


Rsapiens LAG-3 
gene, promoter region 


7e-65 


1944540 


(X141 12) tegument protein 
[human herpesvirus 1] 


2.3 


1789 


Ml 9045 


Human lysozyme 
mRNA. complete cds. 


2e-65 


<NONE> 


<NONE> 


<NONE> 


1790 


U01882 


Homo sapiens SS- 
A/Ro autoantigen 52 
kda component gene, 
complete cds 


2e^65 


585401 


LIPASE MODULATOR 
PRECURSOR (LIPASE 
HELPER PROTEIN) 
>gi|480045|pir||S36249 lipB 
protein - Pseudomonas glumae 
>gi|49207 (X70354) helper 
protein 


4.2 


1791 


AF069517 


Homo sapiens RNA 
binding protein DEF- 
3 mRNA. complete 
cds 


2e-65 


3212101 


(AF069517) RNA binding 
protein DEF-3 [Homo sapiens] 


te-25 
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SEC 
ID 


1 Nearest 
\ ACCESSION 


r*cignDor 'Bias vs* 

< DESCRIPTION 
Homo sapiens jerky 


P VALUE 


Nearest Neiah 
ACCESSION 


x>r (BlastX v Sl Non-Redundant P 
DESCRIPTION 


rotcins) 
P VALUE 


1792 


J moans 


gene product 
homolog rnRNA. 
complete cds 


2e-65 


2314829 


( AF0047 15) jerky gene product 
homolog [Homo sapiens 1 


2e-45 


1793 


1 X59652 


C iongicaudatus hpit 
mRNA for 
hvpoxan thine 


3eM56 


631625 


hypoxantmne (guanine) 
phosphoribosyltramferasc - long 
tailed hamster 
phosphoribc&ylinuisferase 
[Cricetulus longicaudatusl 


6e-54 


1794 


1 U94350 


Mus musculus radical 
fringe precursor 
mRNA* complete cds 


! 3e-67 


2204355 


(U94350) radical fringe 
precursor [Mus musculus] 


2e-33 


1795 


AF01581L 


Mus musculus 
putative 

lysophosphatidic acid 
acyltransferase 
mRNA. complete cds 


3e68 


2317725 


(AF01581 1) putative 

1 Vtnnhhf nil Ttirlir* *it*\t\ 
l/*U|SllU^^f JI411UIU JllU 

acyltransferase [Mus musculus 1 


7e-5l 


1796 


J03137 


Cow 

phosphoinositidc- 
specific 

phospholipase C 


3e-69 


226908 


phospholipase C 154 [Bos 
taurus] 


3c-25 


1797 


AF044574 


Rattus norvegicus 
putative peroxisomal 
2,4-dienoyUCoA 
reductase (DCR- 
AKL) mRNA, 
complete cds 


le69 


4105269 


(AF044574) putative 
peroxisomal 2,4-dienoyl-CoA 




1798 1 
1799 


AF015SU i 


Mus musculus 
putative 

ysophosphatidic acid 
acyltransferase 
tiKNA, eompleie cds 


4e^70 


^317725 


(AF01581 1) putative 
ysophosphatidic acid 
icyltransfcrase [Mus musculus] 


3e-19 


I 
f 

X65157 [ 


VLmusculus mRNA 
or desmoyokin. 
)artial 


5c-74 


c 

109781 ; 


iesmoyokin - mouse (fragment) 
>gi|50675 


9c-37 


1800 1 


f 

Z97207 f 


tfus musculus mRNA 
or B-IND1 protein 


2e-74 


( 

2231019 r 


Z97207) B-lNDl protein [Mus 
nusculus) 


6e-2i 


1801 1 


( 
r 

U27196 n 


jallus gallus zinc 
ingcr protein (Ftf-1) 
nRNA. complete cds. 


6e-75 


( 

984814 f 


U27196) zinc finger protein 
Gallus gallus] gallus] 


2e-44 



loi 
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Nearest Neighbor (BlastN vs. Gen bank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












7okbWb-kE^Al l tuMogr 




1802 


Y 15054 


K art us norvegicus 
mRNA for 70 kDa 
tumor specific 
antigen, partial 


3e-77 


3123027 


>gi|2505957|gnI|PlD|e353992 
(Y15054) 70 kD rumor- specific 
antigen [Rattus norvegicusl 


4c-42 


1803 


X65157 


M. musculus mRNA 

fordesrooyokin. 

partial 


3e-79 


109781 


desraoyokin - mouse (fragment) 
>gi|50675 


9e-33 


1804 


U50736 


Rattus norvegicus 
cardiac adriamyctn 
responsive protein 
mRNA. complete cds 


2e~84 


1362781 


cytokine inducible nuclear 
protein C 193 - human 
>gi|79384l (X83703) nuclear 
protein [Homo sapiens] 


7e^30 


1805 


AF072865 


Rattus norvegicus 
thioredoxin reductase 

nuclear gene 
encoding 
mitochondrial 
protein, complete cds 


2e-84 


3757888 


(ArU72So5) tnioreaoxm 
reductase [Rattus norvegicusl 


6c-43 


1806 


AF044574 


putative peroxisomal 
2,4-dienoyl-CoA 
reductase (DCR- 
AKL) mKiNA. 
complete cds 


6e*S5 


4105269 


(AF044574) putative 
peroxisomal 2,4-dienoyl-CoA 
reductase [Rattus norvegicus] 


Le-41 


1807 


U19181 


Rattus norvegicus 
Rabin3 mRNA, 
complete cds. 


2e-87 


624225 


(U191Si)Rabin3 [Rattus 
norvegicusl 


2e-4l 


1808 


U4UJ4Z 


Mus musculus nine in 
mKlN A, complete cos. 




111 7f}£€ 


(U40342) ninein [Mus 
musculus] 


iC-jo 


1809 


X67S77 


R. norvegicus mRNA 
for cy tosolic 
rpc i n i tWatox i n- 
binding protein 


4e-92 


136077 


TROPOMYOSIN BETA 3, 
FIBROBLAST chicken 
>2il5 15694 (M^308^ 
tropomyosin [Callus gallus] 


0.56 


1310 


AF044574 


Rattus norvegicus 
putative peroxisomal 
2,4-dienoyl-CoA 
reductase (DCR- 
AKL) mRNA. 
complete cds 


5e-93 


4105269 


(AF044574) putative 
peroxisomal 2.4-dienoyl-CoA 
reductase [Rattus norvegicusl 


leoO 


1311 


AF035527 


Mus musculus EHF 
(Ehf) mRNA, 
complete cds 


2c-95 


3133930 


(AF035527) EHF [Mus 
musculus] ; 


2e-47 



1>0 4 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1812 


ABO 16930 


Criceiulus griseus 
mRNA for 

Phosphatidylglycerop 
hosphate synthase, 
complete cds 


6c-96 


4159682 


(AB016930) 

Phosphatidylglycerophosphate 
synthase [Cricemlus griseusl 


7e-4l 


1813 


AB005549 


Rattus norvegicus 
mRNA for atypical 
PKC specific binding 
protein, complete cds 


7e-97 


3868778 


(AB005549) atypical PKC 
specific binding protein [Rattus 
norvefficusl 


3e-41 


1814 


X90849 


G.gallus PB I gene 


2e-97 


2134381 


polybromo 1 protein - chicken 
chicken >gi|95123l (X90849) 
polybromo 1 protein [Gall us 
gall us | 


le-34 


1815 


S79873 


h-lamp-2«(ysosome- 
associated membrane 
protein-2 protein-2b 
(LAMP2) mRNA, 
alternatively spliced 
form h-lamp-2b, 
complete cds. 


3e-98 


<NONE> 


<NONE> 


<NONE> 


1816 


U67203 


Mus musculus ACF7 
neural isoform 1 
<mACF7) mRNA, 
partial cds 


2e^98 \ 


1675224 


(U67204) ACF7 neural isoform 
2 [Mus musculus] 


9e-39 


1817 


LL4684 


Rattus norvegicus 
nuclear-encoded 
mitochondrial 
elongation factor G 
mRNA, complete cds. 


e^lOO 


585084 


ELONGATION FACTOR G* 
MITOCHONDRIAL 
PRECURSOR (MEF-G) 
>gi|543383|pir||S40780 
translation elongation factor G, 
mitochondria! - rat >gi|3 10102 


2e-30 


1818 


XS4692 


M. musculus Spnr 
mRNA for RN A 
binding protein 


c-133 


1363238 


Spermatid perinuclear RNA- 
bindtng protein Spnr - mouse 
>gi|673454 (X84692) spermatid 
perinuclear RNA binding 
protein [Mus musculus) 


5e-35 


1819 


U50736 


Rattus norvegicus 
cardiac adriamycin 
responsive protein 
mRNA. complete cds 


e-U3 


1362781 


cytokine inducible nuclear 
protein C193 - human 
>gi|79384l (X83703) nuclear 
protein [Homo sapiens) 


2e-36 


1820 


S66855 


HoxB9*Hox-2.5 
mice, embryos, 
mRNA Partial, 7S6 
ml 


e-107 


1708355 


HOMEOBOX PROTEIN HOX= 
B9 (HOX-2.5) 


Se-37 



%0<\ 
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Nearest 


Neighbor (BlasuN vs. Genbank) 


Nearest Neighbor (BlastX v S . Non-Redundant Proteins) 


SEQ 
ID 


ACCESSIOfs 


( DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 




it r%r\ f r_ _ -j ir 










1821 


S66855 


[mice, embryos* 
mRNA Partial, 786 
nt| 


e-108 


! 1708355 


HOMEOBOX PROTEIN HOX- 
B9 (HOX-2.5) 


4e-37 


1822 


U92072 


Rattus norvegicus ra- 
tomosyn mRNA, 
complete cds 


e-102 


3790389 


(U92072) m-tomosyn [Rattus 
norvegicus] 


2e-38 


1823 


D 17577 


Mouse mRNA for 
kinesirt-ltke protein 
(Kiflb), complete cds 


e-129 


2497524 


lttNiSlN-L(KEPRbTElN 
KIFIB mouse 

>gi|407339|gnI|PID|d 1005029 
(D 17577) Kiflb [Mus 
musculus) 


2e-39 


1824 


AF062484 


Mus musculus SDP8 
mRNA, complete cds 


e-I22 


3126981 


(AF0624S4) SDP8 [Mus 
musculus] 


5e-40 


1825 


X73683 


R.norvegicus mRNA 
for histone H3*3 


e-109 


122075 


(H3.3Q) histonc H3.3 - fruit fly 
(Drosophila melanogastcr) 
histone H3.3B - chicken 
>gipll9023ipir||S612lS histone 
H3.3 - fruit fly (Drosophila 
hydei) 1-136) [Oryctolagus 
cuniculusj >si|S046 (X53822) 
niMuiie rxj. gene produce 
[Drosophila melanogastcr] 
>gi|5tl98 gallus] >gi|161190 
(M17876) histone H3 [Spisula 
sohaissimaj >£i|- 1 loD^ 
(Ml 1393) histone 3.3 [Gallus 
gallus] >gi|306S48 (Ml 1354) 
H3.3 histonc [Homo sapiens] 
melanogaster] >gi|96303 1 

variant [Drosophila 
melanogastcr 1 musculus | 


2e-40 


1826 


U67203 


Mus musculus ACF7 
neural isoform 1 
(mACF7) mRNA, 
partial cds 


c-102 


1675224 


U67204) ACF7 neural Isoform 
I [Mus musculus] 


2e-40 


1827 


) 
1 

D l 7577 { 


Mouse mRNA for 
dnesin-Iike protein 
Kiflb), complete cds 


c-131 


( 

2497524 r 


dNESIN-LlKE PROTEIN 

CIFIB mouii 

>gi|407339|£nl|PlD|d 1005029 

D(7577)K;:lb[Mus 

nusculus] 


7e 42 



?>f0 
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Nearest Neiehbor fBlastN vs. Genbank) 


Nearest Neighbor (BlastX v S , Non-Redundam Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1828 


AB016930 


Cricetuius griseus 
mRNA for 

Phosphatidyl glyecrop 
hosphate synthase, 
complete cd$ 


e-13l 


4159682 


Phosphatidyl glycerophosphate 
synthase [Cricetuius griseus) 


3c-43 


1829 


U09874 


Mus muscuJus SKD3 

mRNA eomdlete cds 


e-122 




SKD3 PROTEIN SKD3 (Mus 




1830 


X99145 


Cfamiliaris mRNA 
forC3VS protein 


e410 


1429314 


(X99 145) overexpressed in 
thyroid tissue after TSH 
stimulation [Canis familiarisl 


2e-49 


1831 


X99836 


P.walti mRNA for 
mp associated protein 


fax 




(X99836) ra P 55 [Plcurodeles 

4.1*11 

waltij 


2e-50 


1832 


AF0770O3 


Mus musculus SID 
domain-containing 
adapter protein 
mRNA. complete cds 


c 121 


3550240 


(AF077003) SH3 domain* 
containing adapter protein; 
CD2AP 


3e-51 


L833 


AF060246 


Mus musculus strain 
C57B176 zinc finger 
protein 106 (Zfpi06) 
mRNA, H3a-a allele, 
complete cds 


e^ll8 


3372657 


f\r\jo\j/--4\j) zinc ringer protein 
106 [Mus musculus] 


Ieo2 


1834 


Z I 4030 


R.norvcgicus mRNA 
for TRAP-complex 
gamma subunic. 


e-120 


1 174453 


1KANSLULUN- 
ASSOCIATED PROTEIN, 
GAMMA SUBUNIT (TRAP- 
GAMMA) (SIGNAL 
SEQUENCE RECEPTOR 
GAMMA SLBUNTT) (SSR- 
GAMMA) 

>gi|423I85|pir||S33294 
translocon-associated protein 
gamma chain - rat norvceicus] 


7e-54 


1835 


AFO770O3 


Mus musculus SH3 
domain-containing 
adapter protein 
mRNA. complete cds 


e-132 


3550240 I 


[AF0770O3) SH3 domain- 
containing adapter protein; 
CD2AP 


5e-54 


1836 


i 

L 2042 7 i 


Partus norvegicus 

iihydroxypolyprenylb 

snzoate 

ncthyl transferase 
nRNA. complete cds 


e4l6 


< 
i 

457372 , 


.L50427) 

dihydroxypolyprenylbenzoate 

Tiethyltransrerase 

ii hydroxy poK prenylbenzoate 

nethyltranstirase [Ranus 

lorveeicus) i 


4c-56 



3«« 
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Nearest Neiphh*\r / hli*iN v< f^nhnnl-i 




SEQ 
CD 


ACCESSION 


r DESCRIPTION 


P VALUE 


ACCESSION 


aor iDUMA vs. won-Kcaundant Pi 
DESCRIPTION 


roteins) 
P VALUE 


1837 


XS0169 


M.musculus mRNA 
for 200 kD protein 


e-122 


1717793 


PROTEIN TSG24 (MEIOTlC 
CHECK POINT 
REGULATOR) 
>gi|1083553|pir||A55ll7 ts*24 


2e-56 


1838 


AF080568 


Rattus norvegicus 
CTP: phosphoethanol 
amine 

cytidylylcransferase 
mRNA. complete cds 


e-119 


3396102 


(AF080568) 

CTP phosphoethanolamine 
cytidvlyltrartsferase 


6e-58 


1839 


X99145 


C.familiaris mRNA 
for C3VS protein 


e-121 


1429314 


(X99145) overcxprcssed in 
thyroid tissue after TSH 
stimulation [Canis familiar is] 


2e-53 


1840 


AFO 19075 


Pan troglodytes breast 
and ovarian cancer 
susceptibility 
(BRCAl)gene* 
partial cds 


e-145 


2218154 


(AF005068) breast and ovarian 
cancer susceptibility protein 
splice variant [Homo sapiens] 


le-5S 


1841 


U55042 


Bos taurus myosin X* 
complete cds 


e-122 


1755049 


(U55042) myosin X [Bos 
taurus] 


le-61 


1842 
1843 


AJ007780 


Mas musculus mRNA 
for poly(ADP-ribose) 
poIvmerase-2 


e-119 


3283975 


(AF07252L) poly-(ADPribosyl)^ 
transferase homoloe PARP 


4c-62 


AF072865 


Rattus norvegicus 
thiorcdoxin reductase 
(TrxR2) mRNA, 
nuclear gene 
encoding 
mitochondrial 
protein, complete cds 


e-105 


3757888 


(AF072865) thioredoxin 
reductase [Rattus norvesicus] 


3e-62 I 


1844 


U55042 


Bos taurus myosin X, 
complete cds 


e-121 


1755049 


[U55042) myosin X (Bos 
taurus] 


le-62 


1845 


Xo 1506 


Mouse E46 mRNA 
for E46 protein 


e-139 


114909 


BRAIN PROTEIN E46 


9e-67 


1846 


] 
( 

D90335 i 


bovine mRNA for 
UTP-binding protein 
llpha-subunit 


e-143 


i 
[ 

585 174 


JUANINE SUCLtOTIDE- 
3INDING PROTEIN, ALPHA- 
14 SUBUNIT (GLl) 
>gi| 10871 l!pir||A40891 GTP- 
Ending protein GLl alpha chain 
bovine protein, alpha-subunit 
Bos taurus) 


2e-69 


1847 


I 
1 

U49507 r 


vfus musculus 
36CBA Lisch? 
rtRNA, j?anial cds. 


e-U0 


( 

2121326 s 


AC00212S) Lisch/ [Homo 
apiens] 


2e-74 
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Table 4 





( Nearest Neighbor ( BlastN vs. Gen bank ) 


Nearest Neighbor iBlasiX vs. Non-Redundani Proteins) 


SEQ 

ID 

- 


* *V» V. COj WJ vi 




p i/ a t i rp 






n t / m r IT? 

P VALUE 




<NUNb> 


<N0NE> 


<N0NE> 


<N0NE> 


<N0NE> 


<NONE> 


2 


<N0NE> 


<NONE> 


<N0NE> 


<NONE> 


<NONE> 


<NONE> 


3 


<NONE> 


<N0NE> 


<NONE> 


<N0NE> 


<NONE> 


<NONE> 


4 


<NONE> 


<NONE> 


<N0NE> 


<N0NE> 


<NONE> 


<NONE> 


5 


<NONE> 


<N0NE> 


<NONE> 


<N0NE> 


<NONE> 


<NONE* 


A 
\j 


<N0NE> 


<N0NE> 


<N0NE> 


<N0NE> 


<NONE> 


<NONE> 


7 


<N0NE> 


<N0NE> 


<N0NE> 


<NONE> 


<NONE> 


<NONE> 


3 


<N0NE> 


<NONE> 


<N0NE> 


<N0NE> 


<NONE> 


<NONE> 




<N0NE> 


<N0NE> 


<N0NE> 


<N0NE> 


<NONE> 


<NONE> 


10 


<N0NE> 


<N0NE> 


<N0NE> 


<N0NE> 


<NONE> 


<NONE> 




<N0NE> 


<N0NE> 


<N0NE> 


<NONE> 


<N0NE> 


<NONE> 


19 


<N0NE> 


<N0NE> 


<N0NE> 


<N0NE> 


<NONE> 


<NONE> 


n 
i j 


<NONE> 


<N0NE> 


<N0NE> 


<N0NE> 


<NONE> 


<NONH> 


14 


<N0NE> 


<N0NE> 


<N0NE> 


<N0NE> 


<NONE> 


<NONE> 




<N0NE> 


<N0NE> 


<N0NE> 


<N0NE> 


<NONE> 


<NONE> 


16 


<NONE> 


<N0NH> 


<NONE> 


<N0NE> 


<NONE> 


<NONE> 


17 


<NONE> 


<N0NE> 


<N0NE> 


<NONE> 


<NONE> 


<NONE> 


IS 


<NONE> 


<N0NE> 


<N0NE> 


<N0NE> 


<NONB> 


<NONE> 


19 


<NONE> 


<N0NE> 


<N0NE> 


<NONE> 


<NONE> 


<NONE> 


90 

—V 


<NONE> 


<N0NE> 


<N0NE> 


<NONE> 


<NONE> 


<NONE> 


21 


<N0NE> 


<N0NE> 


<N0NE> 


<N0NE> 


<NONE> 


<NONE> 


39 


<NONE> 


<N0NE> 


<N0NE> 


<N0NE> 


<NONE> 


<NONE> 


23 


<NONE> 


<N0NE> 


<N0NE> 


1079469 


tMDC I protein - crab-earing 
macaque 


9.3 


24 


<NONE> 


<NGNE> 


<N0NE> 


3043656 


(AB0U13S) KIAA05b6 protein 
[Homo sapiens] 


9.3 




<NONE> 


<NONE> 


<N0NE> 


112175 


potassium channel protein RK5 - 
rat protein [Ratios norvegicus] 


8.6 


9A 


<NONE> 


<NONE> 


<NONE> 


3769624 


(AF091565) olfrctory receptor 
[Rattus norvegicus| 


7.2 


97 


<NONE> 


<NONE> 


<NONE> 


3876443 


(Z8I5i7) F: SB 1.0 
(Caenorhabditis elcgans] 


7.1 


28 


<N0NE> 


<N0NE> 


<NONE> 


2224464 


(AB001684) ORF249 (Chlorella 
vulgaris] 


6,9 


29 




t/n 


<NONF> 


1 ^ 1 0707 


(U67940) ORFvegiOb; random 
cDN A sequence [Dfoyostetium 
aioLuiucuni [ 


6,7 


30 


<N0NE> 


<N0NE> 


<NONE> 


22749 L 


protein kinase C II [Xenopus 
laevis] 


6.7 ! 


31 


<N0NE> 


<N0NE> 


<N0NE> 


630575 


C50C3.4 protein - 
Caenorhabditis elegans 


6.0 j 


3: 


<N0NE> 


<N0NE> 


<N0NE> 


137290 


JSltt>Mi0ffclNtoRS\*2 ' 
clover necrotic mosaic virus 
>gi|61466 (X0S02H ORF for 35 
kDa polypeptide (AA 1*317; 
[Red clover necrotic mosaic 
virus] 


6.0 



313 



WO 01/02568 
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SEC 
ID 


1 Ncares 
ACCESSIOI 


Neighbor (BlastN vs. 
V DESCRIPTION 


Genbank) 
P VALUE 


| Nearest Neigh 
1 ACCESSION 


bpr (BlastX vs. Non-Redundant P 

DESCRIPTION 
(X167I1) pid:s30041 (Homo 


Voteins) 
P VALUE 


33 

34 
35 


( <NONE> 

<NONE> 
<NONE> 


<NONE> 

<NONE> 
<NONE> 


<NONE> 

<NONE> 
<NONE> 


J_ 30041 
I 2493585 


sapiens] 

CELL DIVISION PROTEIN 
FTSW 


5.7 


36 


<NONE> 


<NONE> 


<NONE> 


1 1001450 
! 3182918 


. D63999) hypothetical protein 
N11KOGEN REGULATORY 
PROTEIN AREA 


5,7 
5.2 


37 


<NONE> 


<NONE> 


<NONE> 


1 I400U 


' MITOCHONDRIAL— 

RIBOSOMAL PROTEIN S5 
Emericclla nidulans 
mitochondrion (SGC3) 
>gi|12709 nidulans] >gi|472822 
(JO 1390) unknown protein 


4.3 


38 


<NONE> 


<NONE> 


<NONE> 


3979943 


(AL0W393) predicted using 
Gencfmdcr; similar to WD 
domain, G-beta repeat; cDNA 
EST yk362f7.5 comes from this 
gene; cDNA EST yk362f7.3 
comes from this gene 
fCaenorhabditis eieffansj 


4.0 


39 


<NONE> 


<NONE> 


<NOKE> 


950203 


(U31329) polykctide synthase 
Aspergillus terreusl 


3.3 


40 


<NONE> 


<NONE> 


<NONE> 


3560232 


(AL03I530) hypothetical zinc 
finger protein 

Schizosaccharomyces pom be] 


3.0 


41 


<NONE> 


<NONE> 


<NONE> 1 


730071 


AXONEME-ASSOCIATED 
PROTEIN MST 101(1) product 
Drosophila hydeil 


2.6 


42 


<NONE> 


<NONE> 


<NONE> 1 


J 
I 

2506641 


WOTHETKAL2L7 KD 
PROTEIN IN INTE-PIN 
iNTERGENIC REGION 
>gi| 1 787402 (AE000214) orf, 
lypothetical protein 
Escherichia coli] 


2.5 


43 


<NONE> 


<NONE> 




3511232 c 


AF07 1556) anthranilate 
iioxygenase large subunit 


2.4 


44 


<NONE> 


<NONE> 


<NONE> 


1150900 j 


U43139) envelope glycoprotein 
pl20 [Human 

mmunodeficiency virus type 11 


1.9 


45 


<NONE> 


<NONE> 


<NONE> | 


( 
h 
E 

8 
E 

3876099 B 


£75536) similar to dynein 
eavy chain; cDNA EST 
MBL:D27549 comes from this 
ene; cDNA EST 
MBL:D34859 comes from this 
ene [Caenorhabditis elegansl 


1.4 



3h 



WO 01/02568 
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Nearest Neighbor (BlostN vs. Genbonk) I Nearest Nei ghbor (BtasiX v«! Nnn-Redundant Protein 




DESCRIPTION 



(AL032647) predicted ustng^ 



LOLAiNllC 1 AClB OVULAR 
^BIOSYNTHESIS 
ACTIVATION PROTEIN A 
>gij95605|pir||S 17701 rcsA 
protein 



P VALUE 



(U61380) germination protein 
Bacillus megaierium 



.jgaierium] 
kPUiHhlltAL lf.4iO> 
IPROTEIN IN FOXl-KEXi 
INTERGENIC REGION 
>gi|2I32566|pir||S64222 
probable membrane protein 
YGL204c - yeast 
(Saccharomyces cerevisiae) 
>gi|l322838|gnl|PID|e243803 
l(Z72726)ORF YGL204c 
1[ Saccharomyces cerevisiae] 



1.4 



U 



[(AJ006514) prolipoprotein 
diacylglyccryl transferase 
[Vibrio cholerael 



|(AL021428) hypothetical 
rein Rv0064 



(U17986) GABA/noradrenaline 
bmo 



transporter [Homo sapiens] 



(AF076184) cytosolic sorting 
protein P ACS- lb [Rattus 
nor vesicas] 



0.84 



0,31 



0.27 



0.13 



0.12 



t"U!)^JU4j coded lor by C 

elegans cDNA yk92b4.5: coded 
'for by C. elegans cDNA 
Iyk73ai.5; coded for by C 
elegans cDNA ykl02e°.5; 
coded for by C. elegans cDNA 
yk7Ic8,5; coded for by C. 
elegans cDNA yk66dll.5; 
coded for by C elegans cDNA 
yk66c3.. 



Bkm-like sex-determining 
region hypothetical proccin 
CS314 * fruit fly (Drosophila 
melanogastcr) 



0,074 



0.003 



Ras inhibitor (clone JC265) ■ 
human sapiens] 



0.002 
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Nearest Neighbor (B lastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundnm Pmreinsi 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












Bkm-like sex determining 




57 


<NONE> 


<NONE> 


<NONE> 


103076 


region hypothetical protein 
CS314- fruit fly (Drosophila 
melanogaster) 


2c-04 


58 


<NONE> 


<NONE> 


<NONE> 


2702370 


(AF038604) contains similarity 
to Drosophila ovarian tumor 
locus protein (GB:X13693) 
[Caenorhabditis elegans] 


6e-05 


59 


<NONE> 


<NONE> 


<NONE> 


3859713 


(AL033501) phox domain 
protein [Candida albicans] 


3e-05 


60 


<NONE> 


<NONE> 


<NONE> 


2088839 


(AF003386)F59E115 gene 
product [Caenorhabditis 
elegans] 


2e-08 


61 


<NONE> 


<NONE> 


<NONE> 


121059 


CtfWOTSEGUEMCEbNA- 
BINDING FACTOR GCF - 
human >gi|I794I2 (M29204) 
DNA-binding factor [Homo 
sapiens] 


4e-09 


62 


<NONE> 


<NONE> 


<NONE> 


3875246 


(7^1490) similar to W1J 
domain, G-beta repeats (2 
domains); cDNA EST 
EMBL:T00482 comes from this 
gene; cDNA EST 
EMBL:T00923 comes from this 
gene; cDNA EST yk449d4.3 
comes from this gene; cDNA 
EST yk449d4.5 comes from this 
gen... 


9c-24 


63 


<NONE> 


<NONE> 


<NONE> 


1465834 


(U64857) No definition line 
found [Caenorhabditis elegans] 


9e-28 


64 


<NONE> 


<NONE> 


<NONE> 


3327136 


(AB0I4561) KIAA066! protein 
Homo sapiens) 


le-29 


65 


<NONE> 


<NONE> 


<NONE> 


3880433 


(Z66521) similar to 
mitochondrial RNA splicing " 
MSR4 like protein; cDNA EST 
EMBL:C09217 comes from this 
gene [Caenorhabditis elegans] 


8e-31 


66 


D42133 


Rat annexin V gene, 
exon7 and exon8 


5,0 


<NONE> 


<NONE> 


<NONE> 


67 


( 

i 

L35679 1 


Htomo sapiens 
subclone H8 2_dll 
from PI 35H5C8) 
ON A sequence. 


5.0 


< 

I 
j 

1086902 < 


[UAlim coded tor by C. 
slegans cDNA yk79g8,5; coded 
For by C. elegans cDNA 
:mi0cS; coded for by C. elegans 
:DNA yk79g3.3; similar to 
eucinerrich repeats found in 
-nany proteins [Caenorhabditis 
ilegans] 


6.6 



WO 01/02568 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


pen 

ofcv, 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 




HIV- 1 strain BX220 










68 


U90184 


from USA, envelope 
glycoprotein C2V3 
region (env) gene, 
partial cds 


5.0 


1297070 


(Z71986) convicilin precursor 
[Vicia narbonensis] 


6.6 


69 


1 U61465 


Human myosin Vila 
(MY07A) gene, * 
exon 37 


10 


231122S 


(AE0O0535) L-lactate permease 
(IctP) [Helicobacter pylori 


J.U 


70 


AF013717 


Homo sapiens 
periplakin (PPL) 
mRNA, partial cds 


5.0 


" ' 3719238 


(AF064869) brain-enriched 
guanylate kinase -associated 
protein 2; BEGA2 (Rattus 
norvegicus] 


3.8 


71 


X5S245 


Soybean mRNA for 
HMO- 1 like protein 


5,0 ' 


2995363 


(AL022245) biotin synthase 


0.99 


72 


AF102425 


Frasera paniculate 
tRNA-Leu (tmL) 
gene, intron* 
chloroplast sequence 


4.9 


3522958 


(AC0044U) putative 
pectinesterase [Arabidopsis 

Frifi 1 1 in *t 1 


0.4 


73 


X82817 


H. sapiens 

PTPiC/HCP-variant 
gene 


4.9 


3875514 


[781494) cDN A bM 

EMBL:D27474 comes from this 
gene; cDNA EST 
EMBL:D27473 comes from thi$ 
gene; cDNA EST 
EMBL:T00471 comes from this 
gene; cDNA EST 
EMBL:D34192 comes from this 
gene; cDNA EST 
EMBL:D37241 comes from this 

£fr ne » - 


2.8 


74 


U04827 


vfus musculus brain 
fatty acid-binding 
protein 


4.9 


< 
< 

3676132 


(AL031765) 1- 

cvidence=predicted by content; 
l=method=gcnefindcr;084; 1* 
method_score=3I,96; 1- 
evidence_end; 2- 
:vidence=prcdicted by match; 2- 
tnatch_accession=SPTREMBL: 
393319; 2- 

rnatch„descriprion=HYPOTHE 
riCAL PROTEIN C33A1 1.2.;... 


2e-09 


75 


] 
i 

AF038859 < 


Veospora hughesi 
strain NE1 interna! 
ranscribed spacer 1, 
romplete sequence 


4.8 


<NONE> 


<NONE> 


<NONE> 1 



2*7 
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. SEQ 
ID 


I Nearest 
f 1 

lACCESSIOr« 


Neighbor (BlastN vs. < 

f DESCRIPTION 
M.musculus MFH-1 


jenbank) 
P VALUE 


I Nearest Neifth 
ACCESSION 


bor (BlastX vs. Non-Rcdundam P 
1 DESCRIPTION 


roteins) 
P VALUE 


76 




gene 


4.8 


<NONE> 


<NONE> 




77 




Borrelia burgdorferi 
left chromosomal 
subtelomeric region 


4.8 


4218141 


(AJ236702) HMR1 protein 
• [Antirrhinum ma jus] 


H 


78 


1 U02486 


Mus musculus LAF 
putative membrane 
protein (KRAG) 
gene, exon 3 and 
complete cds 


4 T 8 


3258103 


(AP000006) 367aa long 
hypothetical protein 
[Pyrococcus horikoshii] 


2.7 


79 


AB000280 


Rat mRNA tor 
pepiiQe/nisnatne 
transporter, complete 
cds 


4.8 


806317 


(M29067) unknown protein 
Sacchoromvces cercvisiae] 


0.001 


80 i 


Z49771 


A.cepa mitochondrial 
gene tor in al/jh. 
dehydrogenase 
subunit 3 and 
ribosomal protein 
S12 


4,5 


<NONE> 


<NONE> 


<NONE> 


81 


M63494 


Mouse IgG receptor 
(beta-Fc-gamma»RII) 
gene, exons 6 and 7, 
clones lambda- 

Fc(3,2.93). 


4,3 


<NONE> 


<NONE> 


<NONE> 


82 


Z14035 


S. pom be carl gene 


2.0 


3790665 


(AF099000) No definition line 
found [Caenorhabditis elegansl 


1.2 


1 

83 1 


U17129 


Rhodococcus 
cryuiiopoiis incA. 
(thcA) gene, complete 
cds; and unknown 
genes 


2.0 


2828280 


[AL0216^7) putative protein 
'Arabidopsis thaliana] 
>gi|2832633|gnl|PED|el249651 
'AL021711) putative protein 
Arabidopsis thai i ana] 


2e-26 


84 1 


< 
t 

AE0O1386 s 


E*Tasmodium 
falciparum 
rhromosome 2, 
action 23 of 73 of 
he complete 
equence 


2.0 


( 

4176500 


AL03U77) dJ889M15,3 (novel 
>rotein) 


9e-59 


85 J 


I 

U79292 r 


luman clone 23734 
nRNA sequence 


1.9 


<NONE> 


<NONE> 


<NONE> 


86 I 


( 

s 
f 

V00159 r 


^hloroplasi Euglena 
gracilis gene coding 
ortheSS and 16S 
RNA. 


1.9 


<NONE> 


<NONE> 


<NONE> 



bit 
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SEC 
CD 


J[ Nearest 
jj 

Iaccessioi 


Neighbor (BlastN vs. 

* DESCRIPTION 
Xenopus laevis XL- 


Gen bank) 
P VALUE 


_ Nearest Neighbor (BlastX vs. Non-Redundant F 
ACCESSION J DESCRIPTION 


We ins) j 
a PVALUE 


87 




INCENP <XL- 
INCENP) mRNA, 
complete cds 


1.9 




JL <NONE> 


<NONE> 


88 


J X93206 


H saiinarium TATA 
box-binding protein 
genes and ORfs 


1.9 


<NONE> 


<NONE> 


<NONE> 


oy 


1 UGU979 


Caenorhabditis 
etegans programmed 
cell death specifier 
(ces-2) gene, 
complete cds 


1.9 


<NONE> 


1 

1 <NONE> 


cNONE> 


90 


J X56272 


C. tentans ORFs (A- 
E) for hemoglobin 


1.9 


<NONE> 


1 <NONE> 


<NONE> 


91 


|_ L22383 


Homo sapiens DNA 
sequence, repeat 
region. 


1.9 


<NONE> 


<NONE> 


<NONE> 


92 


! U828I4 


Hirudo medicinal is 
neuron-specific 
protein mRNA, 
complete cds 


1.9 


3822533 


(AF094531) immunoglobulin 
heavy chain precursor 


2.0 


93 


UI8504 


Haplomitrium 
hookcri ISS rRNA 
gene, partial 
sequence. 


L9 


1083969 j 


hypothetical protein 6 - few [pox 
virus virus] 


2.0 


94 J 


X53676 


Pseudomonas stutzeri 
nosDFY genes 
involved in copper 
processi n s 


1.9 


2980781 


;aL022198) putative protein 


0.70 


95 I 


t 
1 

U60086 i 


Dictyostelium 
discoideum multidrug 
resistance 
ransporter/Ser 
protease (tagC) 
nRNA, complete cds. 


1.9 


< 

b 

! 

3879530 c 


^Z49130) cDNAEST 
/k486b9.3 comes from this 
jene; cDNA EST yk486b9.5 
romes from this gene 


6e-05 


96 I 


I 

F 
r 

U33447 c 


-iuman putative G- 
irotein-coupled 
eceptor (GPR17) 
;ene, complete cds 


1.9 


( 
c 

3880034 e 


275550) similar to cell division 
ontrol protein [Caenorhabditis 
iegans] 


7e-I4 


97 | 


n 

> 
S 

M81327 p 


►us scrofa lactoferrin 
nRNA, complete cds. 
::gb|I2S42l|I2842I 
equence 5 from 
aient US 5571691 


1.8 


<NONE> | 


<NONE> 


<;NONE> 
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PCT7US00/18374 





Nearest 


Neighbor (BlastN vs. Genbank) 


- Nearest Neighbor (BlastX vs. Non-Ri-HnnH^ Proteins* 


SEQ 
ID 


ACCESSIOh 


\ DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






S.iniae IclP A IctO 










98 


! Y07622 


genes and ORF1 


L8 


<NONE> 


1 <NONE> 


<NONE> 


99 


M60474 


Mouse myristoylated 
aJanine-rich C-kinase 
substrate (MARCKS) 
mRN A. complete cds 


1.8 


<NONE> 


<NONF> 




100 


Y13901 


Homo sapiens FGFR- 
4 gene 


1.8 


<NONE> 


<NONE> 


<NONE> 


101 


U44400 


Human Down 
Syndrome region of 
chromosome 21, 
clone A3 1D6-1D6. 


1.8 ' 


<NONE> 


<NONE> 


<NONE> 


102 


U92808 


Ruminococcus albus 
beta-glucosidase 
(gluA) mRN A, 
complete cds 


1.8 


<NONE> 






103 


L25051 


Candida albicans 
argininosuccinate 
lyase (ARG4) gene, 
complete cds. 


1.8 


<NONE> 


<NONE> 


<NONE> 


104 


AE000546 


Helicobacter pylori 
26695 section 24 of 
134 of the complete 
genome 


1.8 


<NONE> 


<NONE> 


<NONE> 


105 


J00978 


Xenopus laevis major 
beta^globin gene, 
complete cds, ! 


1.8 


<NONE> 


<NONE> 


<NONE> 


106 


U41716 


Human 

immunodeficiency 
virus type 1 isolate 
rW95-5, vprgene, 
complete cds. 


1.8 


<NONE> 


<NONE> 


<NONE> 


107 


X66^86 i 


3.gallus mRNA for 
ens in 


1.8 


<NONE> 


<NONE> 


<NONE> 


108 


] 

U76636 t 


Xenopus calbindin 
328k mRNA. 
-omplece cds 


1.8 


<NONE> 


<NONE> 


<NONE> 


109 


i 

J00664 l 


abbit embryonic beta 
l-globin gene. 


1,8 


<NONE> 


<NONE> 


<NONE> 


no 


I 

( 

M2I535 r 


-luman erg protein 
ets-relaied gene) 
nRNA. complete cds, 


1.8 


2983160 |) 


AE000693) hypothetical 
protein [Aguifex aeolicus] 


7.7 
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SE<J 
ID 


Nearest 
ACCESSION 


Neighbor (BlastN vs. < 

4 DESCRIPTION 
Rat troponin T 


jenbank) 
P VALUE 


I Nearest Neigh 
1 ACCESSION 


bor (BlastX vs> Non-Redundant g 
DESCRIPTION 


roteins) 

p value 


ill 


M80829 


cardiac isoform gene 
complete cds 


i.8 


999450 


(Z46595) incomplete intedeukir 
1 1 receptor isoform (Homo 
_ sapiens] 


7.3 


112 


D37887 


Cyprinus carpio c- 
myc gene for c-Mye, 
complete cds 


1.8 


3023408 


BRANCHED-CHAIN AMINO 
ACID TRANSPORT SYSTEM 
CARRIER PROTEIN 
(BRANCHED-CHAIN AMINO 
ACID UPTAKE CARRIER) 
>gi|1075007|pir||D64056 
membrane-associated 
component, branched amino 
acid transport system (bmQ) 
homolog - Haemophilus 
influenzae (strain Rd KW20) 
system II carrier protein (brnQ) 




1L3 


AF019765 


Homo sapiens G 
protein-coupled 
receptor kinase 1 and 
G protein-coupled 
receptor kinase lb 
(GRK1) gene, 
alternatively spliced, 
alternative exon 6, 
exon 7, and partial 
cds 


L8 


498643 


fU 10270} G-hnx hinrlina firtnr 

1 [Zea mays] 


7.2 




AF02:>967 


Helicobacter pylori 
J166 virulence 
regulon 
transcriptional 
activator homolog 
gene, partial cds, 
strain-specific 
genomic sequence B2 


1.8 


t 

3850108 


[AL033388) putative calcium- 
Tansporting atpase 
Schizosaccharomyces pom be] 


5.7 


115 


3 
( 

U13183 c 


Cenopus laevis 
Xwnt-4) mRNA, 
omplete cds. 


1.8 


< 

F 
h 

2494853 h 


J KOBAHEE J —*= ' 
HYDROXY ACYLGLUTATHI 
DNE HYDROLASE 
GLYOXALASE II) (GLX II) 
>rotein [Escherichia coli] 
>gi|l786406(AE000130) 
>robable 

ydroxyacylglutathione 
ydrolase [Escherichia coli] 


5,5 



3>U 
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SEQ 
ID 


Nearest 
ACCESSION 


Neighbor (BlastN vs. < 
f DESCRIPTION 


3enbank) 
P VALUE 


| Nearest Nciehl 
ACCESSION 


X>r (BlastX vs. Non-Redundant Pi 
DESCRIPTION 


oteins) 
P VALUE 


116 


S68944 


Na+/Cl(-)-<iependem 

neurotransmitter 

transporter 


1.8 


2276316 


(Z968L0) GLYT-1 LIKE [Home 
sapiens 1 


5.5 


117 


M92905 


Rat calcium channel 
alpha- 1 subumt (rbB- 
I) mRNA, complete 
cds. 


1.8 


3165522 


(AF067607) Similar to cuticular 
collagen; C18H7.3 


5.5 


118 


X 12429 


Xenopus laevis Ul 
70K gene exon 10 


1.8 


2735957 


(AF015685) reverse 
transcriptase domain protein 


3,3 


119 


D83333 


Mouse hepatitis virus 
genomic RNA for 
spike protein, partial 
cds 


L8 


3876559 


\±oivi 1} jwuijiny \t) uunmii— 
cyclin A/CDK2-assodatd 
protein P19 (RNA polymerase 
elongation factor) 
(SW:SKPI_HUMAN); cDNA 
EST EMBL-T001 14 comes 
from this gene; cDNA EST 
yk390fl L5 comes from this 
gene; cDNA EST yk402el 1.5 

CO... 

>gi (38772 1 6|gnl|PID|e 1 346850 
protein P19 (RNA polymerase 
elongation factor) gene; cDNA 
EST yk390fl 1.5 comes from 
this gene; cDNA EST 
yk402elL5 co... 


3,3 


120 


AF016972 


Cervus elaphus 
REDDEER 
mitochondrial D- 
loop, complete 
sequence 


18 


3878057 


(Z99942) similar to von 
Willebrand factor type A 
domain; cDNA EST yk4 12d4.5 
comes from this gene; cDNA 
EST yk412d4.3 comes from this 
gene 


3.2 


121 


( 
i 

ABO 10741 < 


Dncorhynchus mykiss 
nRNA for rtSo.\24, 
complete cds 


1.8 


( 

( 

1730805 


tl X fKJ l IXt 1 1C AJL ZI.U ISJJ 

PROTEIN IN RPS3-PSDI 
INTERGENIC REGION 
>gi[2132762|pir||S63129 
probable membrane protein 
YNL174\v - yeast 
[Saccharomyces cerevisiae) 
>gi|l302152|gnl|PID|e23954S 
Z7145l)ORFYNLi74w 
Saccharomyces cerevisiae] 


2.5 


122 


1 

i 

U32844 < 


-hemophilus 
nfluen2ae Rd section 
159 of 163 Of the 
ompletc genome 


L8 


1 

\ 
i 

72S910 


\ TY?E INCLUSION 
PROTEIN (ATI) camelpox 
/irus >gi|623Sl (X69774) 
MkDa A-typc inclusion protein 
unidentified) 


1.9 
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SEC 
ID 


Nearest 

I 

ACCESSION 


Neighbor (BlastN vs. 
4 DESCRIPTION 


Genbank) 
j P VALUE 


I Nearest Neighbor fBlastX vs. Non.RedundanlP 
1 ACCESSION j DESCRIPTION 


toteins) 
P VALUE 


123 


U1832I 


Human ionizing 
radiation resistance 
conferring protein 
mRNA. complete cds 


1.8 


1 2133273 


ribosomai protein YS7 homolofi 
Emericclla nidulans 


1 A 
1.4 


124 


M28668 


Human cystic fibrosis 
mRNA, encoding a 
presumed 
transmembrane 
conductance regulatoi 
(CFTR). > :- 
gb|Il 1500(11 1500 
Sequence 1 from 
Patent US 5407796 


j 

! 1.8 


i 90492 


filaggrin precursor - mouse 
[(fragment) 


0,87 


125 


AF064553 


Mus musculus NSDl 
protein mRNA, 
complete cds 


1.8 


2501207 | 


PkoUAULK PROTEIN ~ 
DISULFIDE ISOMER ASE P5 
PRECURSOR >gi|106546l 
(U4041 1) Similar to protein 
disul fide- isom erase. 
rCaenorhabditis eleainsl 


n a*7 


126 


AB0O2314 


Human mRNA for 
KIAA0316gene, 
complete cds 


1.8 


115131 1 


regulator v protein 

BRLA (BRISTLE A PROTEIN) 
>gi|83718|pir||A289l3 
regulatory protein brIA - 
Emericella nidulans >gi|168029 
(M2063l)brlA protein 


0.84 


127 


] 

L42096 ; 


Homo sapiens J 
(subclone 10_d2 from! 
P1H21)DNA 
sequence, j 


1.8 I 


|i 

2135624 I] 


netalloproteinase I (EC 3A24.- 
- human 


0.65 


128 


] 

M37278 


*.norvegicus renin 1 
*ene, exons 1-9. 


1.8 1 


1< 

4050OS7 s 


AF109907)S164[Homo 
apiensj 


0.58 


129 


I 

' X82879 c 


\rtificial sequences 1 

3MA Fnr APT ? 
onsensus | 


1.8 1 


|( 
1 

310929 It 


LI 3442) cysteine-rich extcnsin- 
ike protein-4 [Nicotiana 
abacum] 


0,52 


130 


I 

f 

D89729 c 


lomo sapiens mRNA 
or CRM 1 protein, 
omplete cds ! 


U 1 


< 

3559944 [ 


AJ0 10792) Muc5AC protein 
Vfus musculus! 


0,38 


131 


s 

U78076 z 


'lus musculus j 
cpiapterin reductase 
ene, exons 1 and 2 | 


18 1 


(AE000766) enolase- 
phosphatase H-l [Aquifex 
2984225 |aeolicusl 


0.095 



in 
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SEC 
ID 


_J Nearest 
I ACCESSION 


Neighbor (BlaslN vs. Genbank) 
*| DESCRIPTION 1 p VALUE 


| Nearest Neigh 
| ACCESSION 


bor (BlastX vs. Non-Redundant Proteins) 
| DESCRIPTION |P VALUE 




1 X52I33 


Paramecium 168G 
gene for 168G 
surface protein 


1.8 


J 1I53I6 


I^ULLAUtUN ALPHA l(VIII) 
CHAIN PRECURSOR 
(ENDOTHELIAL 
COLLAGEN) 

|>gi|105686Jpir|jS 15435 collagen 
jalpha l(Vni) chain precursor - 


1 

1 

0.073 


133 


M77830 


Human desmoplakin I 
mRNA, complete cds. 


i i.8 


I 1397246 


IU6W44) coded torbyC 

eiegans cUNA ykl I2f3.5; codec 
IforbyC. eiegans cDNA 
Icm21d2; coded for by C 
eiegans cDNA CEESR07F; 
coded for by C. eiegans cDNA 
ykl 12f3.3; coded for by C. 
eiegans cDNA CEESR29F 
[Caenorhabditis eiegans 1 


[| 

Ie-04 


134 


AJ224150 


* »»wiiiwuium uvignci 

EF-1 alpha A-gene 


1.8 


I 1353761 


|(U43192) myosin II heavy chain 
[Naegleria fowleril 


! 2e-05 


135 


AJ005518 


Mus musculus 
somatostatin receptor 
2 gene, exonl and 5" 

xuiiiviiiic rtgiijn 


1*° 1 


1326350 


(U58748) similar to potential 
transmembrane domains in S. 
cerevisiae nulcear division 
RFTl protein (SP:P38206) 


2e-08 


136 1 


AF0O2217 


Ralstonia eutropha 
megaplasmidpHGl 
nitric oxide reductase 
(norB) »ene \ 
complete cds I 


1.8 


3393018 


(AL03 1174) hypothetical j 
protein 


2e«0S 


137 I 


AF039035 


Caenorhabditis 1 
slegans cosmid 1 
C53A3 | 


1.8 1 


1 

3850109 


(AL03338S) 3-oxoacyHacyI- j 
carrier-proteinl-synthase 1 


3e-ll 


138 1 


i 
r 

c 

M81769 r 


3. domestic^ 1 
mmunoglobuUn I 
e arranged gamma | 
rhain mRNA, VJC 
eaion. complete cds. | 


18 1 


( 

If 

30S0527 f 


AL022600) putative mannose-1 
)hosphate gaunyl transferase 
Schizosaccharomyccs pombcl J 


3e-14 


139 1 


Y11106 


'.pasturisPYCl senel 


18 1 


i 
p 
c 

1175412 ( 


iYPOTHETICAL 24.2 KD 
>ROTEINC13A11.03IN | 
CHROMOSOME I >gi|984224 
Z54096) unknown 1 


le-15 


140 1 


d 
k 

n 

U87803 o 


iuman putative 
*a2+/calmodulin- 1 
ependent protein 
inase kinase gene. 3' 
anking region, 
artial sequence J 


1.8 1 


(AL0216S7) putative protein 
[Arabidopsis thaliana] j 
>gi|2S32633|gnl|PID|c 1249651 
(AL02171 1) putative protein 
2S2S280 (FArabidppsis thaliana] | 


3e-17 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 




H.sapiens HBF- 1 










149 


X74I42 


mKNA tor 
transcription factor 


L7 


<NONE> 


<NONE> 


<NONE> 


150 


U40830 


streptococcus 
thermophilus DeoD 

oene nartinl cA^ and 

EpsA, EpsB, EpsC, 
EpsD. EpsE* EpsF. 

Pr^Pr FrwH Fn«T 
cpsvj* np>n, cpat, 

EpsJ, EpsK, EpsL f 

EpsM, Orf 14.9 

protein genes, 

complete cds 


i ^ 
I J 


<NQNE> 


<NONE> 


<NONE> 


151 


L29172 


Rabbit 1% germline 
gamma H-chain 
(allotype dl2,el5) C- 
repion gene. 3' end. 


1.7 


<NONE> 


<NONE> 


<NONE> 




M19045 


Human lysozyme 
mRNA, complete cds. 


1.7 


<NONE> 


<NONE> 


<NONE> 


153 


AE001 159 


Borrelia burgdorferi 
(section 45 of 70) of 
the complete genome 


1.7 | 


<NONE> 


<NONE> 


<NONE> 


154 


L17027 


Fremyella 
diplosiphon) DNA 
sequence, including 
unidentified cds and 
stem loop. 


1.7 


<NONE> 


<NONE> 


<NONE> 


155 


U 12232 


Arabidopsis thaliana 
Columbia GTP 
binding protein beta 
subunit fAGBH 

#UVI<IIII l^»V«*i-* * / 

mRNA. complete cds. 


L7 


<NONE> 


<NONE> 


<NONE> 


156 


D42056 


Arabidopsis thaliana 
ATPK6 mRNA for 
ribosomal-protein S6 
kinase homolog. 
complete cds 


1.7 


<NONE> 


<NONR> : 


<N0NE> 


157 


X9SU7 


Rhizobium 
leguminosarum prsD, 
prsE, ORF3 genes 


1.7 


<NONE> 


<NONE> 


<N0NE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















158 


AF039084 


Spinacia oleracea 
heat shock 70 protein 

nrfitein cnmnlcte ccfc 


1.7 


<NONE> 


<NONE> 


<NONE> 


159 


Z 12651 


R.norvegicus gene for 
catechol 

methyltransferase 


1.7 


<NONE> 


<NONE> 


<NONE> 


160 


AF002968 


Fringilla coclcbs 
mitochondria] control 
region* partial 
sequence 


1.7 


<NONE> 


<NONE> 


<NONE> 


161 


AE001160 


Borrclia burgdorferi 
(section 46 of 70) of 
the complete genome 


1.7 


<NONE> 


<NONE> 


<NONE> 


162 


U67553 


Methanococcus 
iannaschii section 95 
of 150 of the 
complete genome 


1,7 


<NONE> 


<NONE> 


<NONE> 


163 


M86247 


S.ruminamium 
nlnsmid nS^ DNA 


1.7 


<NONE> 


<NONE> 


<NONE> 


164 


S74436 


oIL-8=*interleukin-S 
[sheep, spleen cells, 
mRNA. 1435 nt] 


1,7 


<NONE> 


<NONE> 


<NONE> 


165 


D 127 19 


Candida maltosa 
ALK7 (CYP52A10) 
and ALK8 complete 
cds 


1.7 


<NONE> ! 


<NONE> 


<NONE> 


166 


U02625 


Geotrichum 
candidum NRRL Y- 
553 lipase gene, 
partial cds. 


1.7 


321245 


230k bullous pemphigoid 
antigen BPM1 - mouse 


9,3 


167 


Z58881 


H.sapiens CpG DNA. 
clone 1 14a4 f reverse 
read cpel 14a4.rtla . 


\J 


1854675 


(U66298) bone morphogenetic 
protcin-6 [Rattus norvegicus] 


9.1 


168 


U43674 


Agrobacterium 
tumefaciens conjugal 
transfer region 1 
genes 


1.7 


1352066 


Large PKULiNb-KicH 

PROTEIN BAT2 MHC class III 
histocompatibility antigen HLA- 
B-associated transcript 2 - 
human >gi| 179339 (M33509) 
HLA-B- associated transcript 2 
(BAT2) (Homo sapiens] 
>gi| 179345 (M335I8) HLA-B- 
associated transcript 2 (BAT2) 
[Homo, sapiens] 


9,1 
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SEQ 
ID 


| Nearest 
ACCESSION 


Neighbor fBlastN vs. C 
1 DESCRIPTION 


Jenbank) 
p vat rrp 


Nearest Neiphl 


xar fBiastX vs. Non-Redundant Pi 

nF^n) TT>TTr\Kr 

FRU 1 UFUKPH Y RflTOSEtt 


•oteins) 
P VALUE 


169 


AL023827 


Cacoorhabdiiis 
elegans cosmid 
Y12A6A, complete 

[Caenorhabditis 
elegans] 


1.7 


731440 


OXIDASE (FPU) yeast 

(Saccharomyces cerevisiae) 
>gl|o0360o (u 18778) Heml4p: 
protoporphyrinogen oxidase 
[Saccharomyces cerevisiae] 
>gi|l403536|gnl|PlD|e249333 
(271381) protoporphyrinogen 
oxidase [Saccharomyces 
cerevisiae] 


8.9 


170 


X69662 


^ Jaevis mRNA for 
glutathione 
synthetase, large 
subunit 


1.7" 


4038057 


(AC005897) hypothetical 
protein [Arabidopsis thaliana] 


8.8 


171 


Z35824 


S cerevisiae 
uiiroiuosurne 11 
reading frame ORF 
YBL063W 


L7 


3021450 


(Y15515) prdl-a [Hydra 
vulgaris) 


7.0 


172 


M65139 


Cowpea chlorotic 
mottle virus (CCMV) 
la protein gene, 
complete cds. 


L7 


2506307 


LULLAGEN ALPHA 1(X11) 
CHAIN PRECURSOR l(XII) 
chain - chicken ! 
>gi|222 8 1 1 |gnl|PID|d 100 1 160 
gollusj 

>gi|2326442|gnl|PED|e39435 
(X61024) collagen type XII 
alpha 1 chain [Gallus gallusl 


7.0 


173 


X15065 i 


Drosophila distal BX- 
C region (bithorax 
:omplex)pH189 5* 

region; 


.1.7 


i 

1723625 1 


HyPOI HETIcAL IttOKD 
PROTEIN IN ALPA-GABD 
(NTERGENIC REGION (F87) 
>gi|l033I24 (U36840) 
DRF_fS7 [Escherichia coli] 
>gi|1788982 (AE000348) orf, 
lypothetical protein 


6.9 



WO 01/02568 



PCT/USOO/18374 





Nearest 


Neighbor (BiastN v$. Genbank) 


Nearest Neighbor CBIastX vs. Non-Redunrirmr Ptt*m*\ 


SEQ 
ID 


ACCESS^ 


f DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


174 


246255 


S.ccrcvisiac 
chromosome VI 
lambda clone. 


• 

i.7 


3875228 ! 


(246792) similar to lcthal(l) 
discs large- 1 tumor suppressor 
prorein-like repeats; cDNA EST 
EMBL:D33495 comes from this 
gene; cDNA EST 
EMBL:D35 1 17 comes from this 
gene; cDNA EST 
EMBL:D36356 comes from this 
gene; cDNA ESTEMB... 
>gi|3879984|gnl|PID|e 1 35 1767 
suppressor protein-like repeats; 
cDNA EST EMBL:D33495 
comes from this gene; cDNA 
EST EMBL:D351 17 comes 
from this gene; cDNA EST 
EMBL:D36356 comes from this 
gene; cDNA EST EMB,., 


•> 

6.7 


175 


U01066 


Human CD4 
promoter, partial 
sequence. 


1.7 


125448 


THYMIDINE KINASE 
saimiriine herpesvirus 1 (strain 
11 [One]) >gi|6034t 


6.7 


176 


U34743 


Phalaenopsis sp- 
*hybridSM9108 < 
homeobox protein 
rnRNA* complete cds 


1.7 


1022918 


(U38184) ATPase subunit 6 
[Trypanosoma cruzi) j 


6.7 


177 


U 14662 


Baboon herpesvirus 
HVP2 gB 

glycoprotein (UL27) 
gene, complete cds. 


1.7 


3218378 


(AL023862) hypothetical 
protein SC3F9.07 [Streptomyces 
coelicolorl 


6.7 


178 


AB017006 


Homo sapiens 
PMS2L15 mRNA, 
partial cds 


1.7 


■ 

1465855 


[U64859) glutamine-rich protein 
[Caenorhabditis elegans] 


6,7 


179 


i 
I 

U92651 c 


Jrassica oleracca var. 
xrtrytis tonoplast 
ntrinsic protein 
50bTIP26-l mRNA, 
;omplete cds 


1.7 


] 
< 
\ 

3023675 


OYNEIN HEAVY CHAIN, 
:YTOSOLIC (DYHC) dynein 
leavy chain 

Schizosaccharomyces pombe] 


6.6 


180 


I 

r 

AF000634 r 


-ytechinus variegatus 
»otch homolog 
nRNA r complete cds 


1.7 


( 
I 

148574 s 


M58520) endo-l,4-beta- 
^lucanase [Fibrobacter 
uccinoaenesl 


6.6 ! 
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Nearest Neighbor ( BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins* 


SEQ 
ID 


ACCESSION 


\ DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


181 


M92354 


Arabidon«»i<; thaliana 
anthranilate synthase 
alpha subunit gene, 
complete cds^ 


1.7 


738308 


blue light photoreceptor 
[ Arabidopsis thaliana] 


6.5 


182 


AJ234856 


HnrHpjiin vtilparp 

genomic DNA 
Fragment; clone 
MWG2234.rev 


1.7 


3142302 


( AC00241 1) Strong similarity to 
myosin heavy chain gb|Z34293 
from A, thaliana. [Arabidopsis 
thaliana] 


6.5 


183 


U76827 


Sterconinus 

paio&iiJvUo vim jjj 

cytochrome b protein, 
partial cds 


1.7 


3413810 


(Y 17034) Bassoon [Mus 
muscuius) 


5.4 


184 


tins'? 1 1 


Saccharomyces 
cerevisiae Ttplp 
(TTP1) gene, 
complete cds. 


IJ 


403173 


(L24492) lipoprotein 
[Rhodococcus erythropolis] 


4,9 


185 




Homo sapiens 
TRRAP protein 
(TRRAP) mRNA, 
complete cas 


1-7 


1170140 


PUTATIVE 

ENDOGLUCANASE TYPE K 
PRECURSOR CENDO-M- 
B ETA-GLUC AN ASE) 
(CELLULASE) 


4.1 


136 


AE000753 


A /*i 1 1 r T f* v 1 j^i^s I * /"M 

/\qiillCX dCOUCUS 

section 85 of 109 of 
the complete genome 


1-7 


U69357 


DNA ADENINE METHYLASE 
site-specific DNA- 
methyltransferase (adenine* 
specific) dam methylase gene 
product [Vibrio cholerae] 


4.0 


187 




Tupaia glis 
apolipoprotein AI 
prepropeptide 
iubjt f\ t conipieie cos 


1 7 

I./ 




(AL031 124) putative secreted 
lyase 


4.0 


188 


M23090 


Human germ line IgK 
chain gene V3-region T 
clone Humkv328h5 


1 ,7 


2257483 


(AB004534) pi003 
Schizosaecharomyces pombe) 


4,0 


189 


< 

M24001 ( 


4ink enteritis virus 
intigenic type 2 
:apsid protein genes 
VP I and VP2, 
:omplete cds. 


1,7 


] 

< 

2143504 


myotonic dystrophy kinase - 
mouse (fragment) kinase, DM- 
cinase (C-tcrminal, alternatively 
spliced, clone delta ILIIIJV.VJ 
mice, brain, Peptide Partial, 
174 aa) [Mus sp.] 


3.9 


190 


1 

X59964 1 


•i.sapiens CST4 gene 
or Cystatin D 


1.7 


( 

1766075 ( 


U37273) winged helix protein 
: WH-2 [Gallus gallus] 


3.1 



32* 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redimdanr Antrimi 


SEQ 
ID 


a rrpc cr/"\w 






ACCESSION 


Ufc^CRlFTION 


P VALUE 
















191 


X95276 


^falciparum 
complete gene map o( 
plastid-like DNA (IR- 
B) 


1.7 


3219951 


PROTEIN C6B 12. 13 IN 
CHROMOSOME I 

>gi|2330843|gnl|PID|e334047 
pombe] 


3.0 


192 


D84487 


Rat PMSG-induced 
ovarian mRNA, 
3 , sequence,N10 


1.7 


173164 


(J02719) valyMRNA synthetase 
(Saccharomyces cerevisiae] 


2.3 


193 


L14851 


Rauus norvegicus 
ncurexin III-aJpha 
gene, complete cds. 


1.7 


3323586 


(AF060869) single-strand 
binding protein [Salmonella 
typhimurium] 


23 • 


194 


M97002 


Xenopus laevis/gilli 
hybrid pseudo-IgH 
chain gene, V region, 
clone LG7G342A- 


17 


2118407 


MHC sex-limited protein - 
mouse (fragment) musculus] 


2.3 


195 


L07025 


pannos trronngiensis 

delta-endotoxin 

\\^iy r\{ajj gene, 
end. > :: 

gb)I34520|I34520 
Sequence 1 from 
patent uj> j^you/i > 

gb|I39790|I39790 
Sequence 1 from 
patent US 5616495 > 

gb|AR0OS487|AR008 
487 Sequence I from 
patent US 5753492 


L7 


2496940 


HYPOTHETICAL 53.4 KD 
PROTEIN D 1054. 13 IN 
CHROMOSOME V 
>gi|38753 16|gnl|PID|e 1344967 


1.8 


196 


S73149 


insulin-like growth 

fnrfnr H /inrrnn 71 

human, Genomic, 
1702 nt] 


17 


3327038 


(AB0145 12) KIAA0612 protein 
Homo sapiens] 


1.8 


197 


D86990 ( 


Human (lambda) 
DNA for 

mmunoglobulin light 
:hain 


17 


] 

494367 ( 


Fv Fragment (Murine Se 155-4) 
Complex With The 
rrisaccharidc: Alpha-D- 
Galactose(l-2)[alpha-D- 
Abequose(i-3)]alpha- 
vlannose (Pl-Ome) (Part Of 
rhe Cell -Surface Carbohydrate 
Df Pathogenic Salmonella) 


LS ■ 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






plasmid pWA (from 










198 


L 17027 


Fremyetla 
diplosiphon) DNA 
sequence, including 
unidentified cds and 
stem loop. 


1.7 


• 

1082702 


poliovirus receptor-related 
protein - human 


1.4 I 


199 


AL022273 


Caenorhabditis 
elegans cosmid 
H22D14, complete 
sequence 
Caenorhabditis 
ele^ns] 


1.7 ; 


3924605 


(AF069442) putative inhibitor 
of apoptosis (Arabidopsis 
thaliana] 


L4 


200 


U89926 


Drosophila 
meianogaster cut 
gene f partial sequence 


1,7 


2245100 


(Z97343) DNA-binding protein 
homolog 


1.3 


201 


Z25749 


H.sapiens gene for 
ribosomal protein 57 


1.7 


2493459 


PROTEIN KINASE C 
SUBSTRATE, 60.1 KD 
PROTEIN* HEAVY CHAIN 
(PKCSH) (80K-H PROTEIN) 
>*ij 1215746 


U 


202 


U59S41 


Fundulus heteroelitus 
lactate dehydrogenase 
B 


1.7 


30055S7 


(AF048977) Ser/Arg-relatcd 
nuclear matrix protein [Homo 
sapiens) 


0.82 


203 


X55763 


Rabbit mRNA for 
smooth muscle 
calcium channel 
blocker (CaCB) 
receptor 


1.7 


3883128 


(AF082302) arabinogalactan- 
protein [Arabidopsis thaliana] 


0.82 


204 


Z75528 


Caenorhabditis 
elegans cosmid 
C18B12A, complete 
sequence 
[Caenorhabditis 
elegans] 


1.7 


940397 


(D10123) core [Hepatitis C 
virus] 


o.so 


205 


U50912 


Human XI ST gene, 
poly purine- 
pyrimidine repeat 
region 


17 


2338027 


(AF005370) large tegument 
protein [Alcelaphine herpesvirus 
11 


0,59 


206 


X12817 


Ovis arics beta- 
lactoetobulin gene 


1.7 


987050 


(X65335) lacZ gene product 
[unidentified cloning vector] 


0.45 


207 


AF004419 


Homo sapiens 
troponin T (TNNT2) 
sene, exon 13 


1.7 


2996364 


(AF053947) unknown [Yersinia 
pestisl >ai|3883090 


022 


208 


L43643 


Gal I us domestic us 
DNA microsatellke 
marker MCW119 


1.7 


464896 


TRANS DUCIN-LIKH 
ENHANCER PROTEIN 1 
enhancer-of-split homolog TLE- 
1- human >gi|307510 


a:o | 
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Nearest Neichbor (BlasuN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 

IXJ 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















209 


273278 


S.cerevisiae 
chromosome XII 
reading frame ORF 
YLR106C 


1.7 


1351657 


HYPOTHtriCAL 123.9 KD 

PROTEIN t-JOLJii.WC IN 

CHROMOSOME I 
>gi|2130411|pir||S62562 
hypothetical protein 
SPAC30D1 1.4c - fission yeast 
nuclear pore complex protein 
[Schizosaccharomyces pombel 


0.20 


210 


M22345 


Mouse endogenous 
provirus gag. pol, and 
env region DNA. 


1.7 


2444455 


(AF020765) hypothetical 
protein [Myxococcus xan thus) 


0.12 


211 


AE000360 


Escherichia coli K-I2 
MG1655 section 250 
of 400 of the 
complete genome 


i.7 


2736361 


(AF039038) No definition line 
found [Caenorhabditis elegans] 


0.12 


212 


AB020692 


Homo sapiens mRNA 
for KIAA0885 
protein, complete cds 


1.7 \ 


2605924 


(AF029726) histidinc kinase C 
[Dictyostelium discoideum] 


0.094 


213 


S69429 


testis-determining 
gene/SRY homolog 
[Sminthopsis 
macroura=striped- 
faced dunnarts. 
Genomic, 855 nt] 


1.7 


2499016 


TONB PROTEIN >gi| 16665 36 
(U23764) TonB [Pseudomonas 
aeruginosa] 


0,092 


214 


S69429 


testis-determining 
gene/SRY homolog 
[Sminthopsis 
macroura=striped- 
faced dunnaits, 
Genomic. 855 ht] 


1-7 


2499016 


TONB PROTEIN >gi|!666536 
(U23764) TonB [Pseudomonas 
aeruginosa] 


0.088 


215 


U67205 


Mus musculus ACF7 
neural isoform 3 
(mACF7) mRNA, 
partial cds 


1 "7 

1.7 




(AF000198) weak similarity to 
rlorVU i\_aenornaLKiuis cicx«uiaj 


0052 


216 


X98188 


Artificial DNA 
sequence for 
mammalian lambda- 
neo minichromosome, 
1400 bp 


1.7 


2493779 


PUTATIVE CUTICLE 
COLLAGEN C09G5.6 
collagen; cDNA EST yk244c3.5 
comes from this gene; cDNA 
EST yk244c3 .3 comes from this 
gene [Caenorhabditis clcgansl 


0.042 


217 


U70139 


Mus musculus 
putative CCR4 
protein mRNA. 
partial cds 


1.7 


2252630 


(U95973) hypothetical protein 
[Arabidopsis thaliana] 


0-041 



WO 01/02568 



PCT/US00/18374 



ID 


Nearest 
ACCESSIOh 


Neighbor (BlastN vs. < 
i DESCRIPTION 


jenbank) 
P VALUE 


I Nearest Neigh 
| ACCESSION 


bor (BlasiX vs. Non-Redunrf.-im p 
DESCRIPTION 


roteins) 
P VALUE 


218 


L38808 


Homo sapiens aJpha- 
i type V collagen 
(COL5AI)gene, 5' 
flank and exon I. 


17 


1 2895760 


\tw*j£**o) universal mtnicirc 
sequence binding protein 
minicircte sequence binding 
protein [Crithidia fasciculata] 


* 

0.039 


219 


Z72151 


B.napus mRNA for 
AMP-bindine protein 


1.7 


I 190475 


(K02576) salivary proline-rich 


0.011 


220 


X94I52 


R-nOrvegicus mRNA 
for cysteine sulfonate 
decarboxylase 


1,7 


• 2136212 


avnapsin uo « numan 
>gi|1594277 (U40215) synapsin 
lib [Homo sapiens] 


0.008 


221 


L20255 


Mouse stathxnin gene 
sequence. 


1.7 


23 1 7914 


(U97553) unknown [murine 


0,006 


222 
223 


L13600 


Rattus norvegicus 
glycine transporter 
mRNA, complete cds. 


1.7 


11(\AIW 


(U23 175) similar to anion 
exchange protein 
Caenorhabdius elegans] 


0.003 


AJ224150 


Plasmodium berghei 
EF-1 alpha A- gene 


1.7 


2072290 


(U95094) XL-INCENP 
[Xenopus lacvis) 


0.001 


224 


S80642 


butyrophilin [mice, 
lactating mammary 
gland, mRNA Partial, 
3193 tit) 


1.7 


2695746 


(AJ2230t0)Pmi2 
[Schizosaccharomyccs pom be] 


9e~Q4 


225 


M22363 


C.elegans unc-86 
gene encoding two 
alternative proteins, 
complete cds. 


1-7 I 


2224683 


(AB002369) KJAA0371 [Homo 
sapiens] ! 


Ie-04 


226 


X92123 


M.musculus cgt gene 
exon 1 


1.7 1 


3874232 


(Z49909) similar to Prokaryotic 
ribonuclease PH 
Caenorhabditis elegans] 


3e-05 


227 


1 

AB0160O0 i 


fpomoea nil PKn2 
[knotted-like gene) 
tiRNA, complete cds 


1.7 I 


< 

2183083 


AF000422)TTF-I interacting 
Jeptide 5 [Homo sapiens] 


le-05 


228 


I 

D14L33 s 


Jovine mRNA for 
ynaptocanalin I 


1.7 1 


I 
F 

> 

c 

392527.7 f 


ALUi264j) similar to 
Jncharaclerized protein family 
JPF0034, Double-stranded 
IN A binding motif; cDNA EST 
k489b3.5 comes from this 
cne; cDNA EST yk439g7.5 
omes from this gene 
Caenorhabditis elegans] 


2e-06 
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Nearest Neighbor (BlastN vs. Genbarik) 


I Nearest Neighbor (BlastX vs. Non-Redundant Proteins} 


SEQ 
jD 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 














229 


L01991 


Mus musculus TAFG- 
I -like neuronal 
glycoprotein (PCS) 
mRNA, complete cds. 


L7 


3006139 


(AL022299) hypothetical 
protein 


4e-07 


230 


X63016 


Tomato yellow leaf 
curt virus Thailand 
isolate complete 
genome (TYLCV-TH 
B-DNA) 


1.7 


3643608 


(AC005395) hypothetical 
protein [Arabidopsis thaliana] 


\ le-07 


231 


Z22802 


H^apiens 

microsatetlite repeat 

> :: 

gb|G34562|G34562 
human STS SHGC- 
51834 


1.7 


100210 


extensin precursor (clone Tom L 
4) - tomato escutentum] 


4e-09 


232 


K02765 


Human complement 
component C3 
mRNA, alpha and 
beta subunirs, 
complete cds, 


L.7 


2984320 


(AB000773) acetoin utilization 
protein [Aquifex aeolicus] 


le-09 


233 


Z74818 


S.ccrcvisiac 
chromosome XV 
reading frame ORF 
YOL076w 


1.7 


3873700 


(1.750X1) predicted using 

Genefinder; Similarity to 

Bacillus subtilis DNAJ protein 

gene; cDNA EST 

EMBL:C 12520 comes from this 

gene; cDNA EST 

EMBL:D7 1409 comes from this 

ge,.. 


7e-U 


234 


D21871 


Pig mRNA for thimet 
oligopeptide 


1.7 


2632098 


(Y 155 13) Prodos protein 
BrosophUa melanogaster] 


8e-13 


235 


Y 14344 


Gall us gall us gene 
encoding neurofascin* 
exons9,KUl& 12 


1.7 


3876421 


jLo 1 \)1\>) CUP* A c5 1 
EMBL:C 12730 comes from this 
gene; cDNA EST yk200b6.5 
comes from this gene; cDNA 
ESTyk349al2J comes from 
this gene [Caenorhabditis 
elegansl 


3e-14 


236 


Z73608 


Scerevisiae 
chromosome XVI 
reading frame ORF 
YPL252C 


1.7 


1439663 


(U64605) C05D9.6 gene 
product [Caenorhabditis 
elegans] 


6e-l8 



Vb5 
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Nearest Neighbor (BlasiN vs. Genbank) 


Nearest Neighbor (BlastX v$, Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












tJDnW<5HARYL 




237 


AGO00518 


Homo sapiens 
genomic DNA, 2iq 
region, clone: 
T17IN23 


LI 


U74468 


TRAN^ASeSTTJ 
SUBUNIT HOMOLOG 
>gi|529357(U13019)No 
definition line found 
[Caenorhabditis dedans] i 


6e-l8 


238 


D 177 16 


Human mRNA for N- 
acetylglucosaminyttra 
nsferase V, complete 
cds 


17 


961446 


(D63877) KIAA0157 gene 
product is novel. 


5e-19 


239 


AFI02512 


Cheiiodaciyius 
vittatus country USA- 
Midway Island 
cytochrome c oxidase 
subunit I gene, 
mitochondrial gene 
encoding 
mitochondrial 
protein, partial cds 


1.7 


1572756 


(U70848) C43G2.1gene 
product [Caenorhabditis 
elegansl 


5e-40 


240 


L30I07 


Rattus norvegicus 
jiver-specific 
transporter gene, 
promoter region. 


1.7 


4176443 


(AL022238)dJ1042K10.4 
(novel protein) 


3e*49 


241 


X91220 


H.sapiens mRNA for 
Na-CI electroneutral 
thiazide-sensiuve 
cotransporter 


L7 


3478637 


(AC005S46) R29425_l [Homo 
sapiens] 


6e-54 


242 


U97146 


Rattus norvegicus 
calcium-independent 
phospholipase A2 
mRNA, complete cds 


l!6 


<NONE> 


<NONE> 


<NONE> 


243 


Z48508 


Pea seed-borne 
mosaic virus RNA for 
coat protein and 
polymerase (partial) 


1.6 


<NONE> 


<NONE> 


<NONE> 


244 


Ml 8349 


Rat leukocyte 
common antigen (L- 
CA) gene, exons 1 
throueh 5. 


1.6 


<NONE> 


<rNONE> 


<NONE> 


245 


M 13158 


Yeast (S.pombe) 
cdc25+ gene (mitosis 
initiation), complete 
cds, 


1.6 


<NONC> 


<NONE> 


<NONE> 



WO 01/02568 



PCT/USOO/18374 





L Nearest Neichbor (BlastN vs. Gen bank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proieinsi 


SEQ 

id 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Mycoplasma 










246 


U39712 


genitalium section 34 
of 5 1 of the complete 
genome 


1.6 


<NONE> 


<NONE> 


<NONE> 


247 


M17922 


Mouse Murine 
urokinase-type 
plasminogen activator 
protein gene, 
complete cds. 


1.6 


3875750 


UiH99) predicted using 
Genefinder: cDNA EST 
yk410e3.3 comes from this 
gene; cDNA EST yk410e3.5 
comes from this gene 
[Caenorhabditis elegansl 


8.0 


248 


M89986 


Human polymorphic 
loci in Xo.28. 


L6 


* 3261710 


(284724) psd [Mycobacterium 
tuberculosis] 


6.4 


249 


M89986 


Human polymorphic 
loci in Xq28. 


1.6 


2143805 


inositol-polyphosphate 4- 
phosphatase - rat 


6.2 


250 


U68725 


Rartus norvegicus 
Deleted in colorectal 
Cancer 


1.6 


1256804 


(U51449)RING3 protein 
[Xenopus laevis] 


5.8 


251 


X95199 


P.platessa GSTA, 
GSTAl,G$TA2,and 
PPTN genes 


1.6 


3915113 


maLeYLaCetate 

REDUCTASE Pseudomonas 
cepacia >gi|643636 (U19883) 
malcylacetate reductase 
[Burkholderia cepacia] 


4.9 


252 


Y09103 


D.melanogaster 
RPA1 gene 


1.6 


3916021 


HYP6TJffijiCAl9iKD 
PROTEIN IN COB LNTRON 
>gi|2654230|gnl|PID|el 192341 
(X02819) unidentified reading 
frame [Schizosaccharomyces 
pombe] 


4,8 


253 


Z 14078 


T.aestivum 
mitochondrion fMet, 
I8S, 5S repeat unit 
DNA 


1,6 


2501668 


DYSTROPHIN-RELATED 
PROTEIN 2 sapiens] 


3.6 


254 


AB002314 


Human mRNA for 
KIAA0316gene, 
complete cds 


1.6 


130997 


REPtTl 1 I Vb PROLINE- RICH 
CELL WALL PROTEIN 1 
PRECURSOR 

>gi|8l809|pir||A29324 proline- 
rich protein precursor - soybean 
>gi[170049 (J02746) proline- 
rich protein [Glycine max] 


2.8 


255 


M2L488 


Human muscle 
creatine kinase gene 
(GKMM). exon 2. 


1.6 


1 19399 


ENV POLYPROTEIN 
PRECURSOR (COAT 
POLYPROTEIN) [CONTAINS: 
COAT PROTEIN GP62; COAT 
PROTEIN GP40] 


2.2 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 














256 




Borrelia burgdorferi 
(section 50 of 70) of 

mc coriipieie genome 


t 6 




(AF109907) hypothetical 

nrrrfpin fHnmrt i&niprt^l 

^IIUMIU 9«UltlU| 


1.5 


257 




M.musculus 
rearranged T-cell 
receptor beta variable 
region \ v c?i / a; 


1 ft 




(AF080090) semaphorin IV 

i en form h PMuq muceuliicl 


1.2 


258 


M15346 


Txruzi tandemly 
repeated gene 
encoding an 85 kDa 
antigen with 

ki\mr\lAii\i ¥r\ naif 

nomoiogy 10 neat 
shock proteins. 


1.6 


2804437 


(AF043695) similar to zinc | 
mctalloprotense family of 

elegans] 


0.41 


259 


L39018 


Rattus norvegicus 
sodium channel 
protein o lav-iroj 
mRNA, complete cds 


1.6 


2920535 


rAF0IR08n rvne XVIII 
collagen [Homo sapiensl 


0.037 


260 


M29483 


Human leukocyte 
adhesion protein 

n| CA QC i|n)in Cllhltntt 

pijvyj iupna suDunii 
gene, exons 7 - 15. 


1.6 


1840045 


f"TT4Q08?^ trnnsnortcr nrotein 

(Homo sapiensl 


2e-09 


261 


L06844 


A^nfreilltK hiffcr beta 
f>fructofu ran oxidase 
(sucl) gene, one 
exon. 


1.6 


4206210 


(AF071527) putative calcium 
channel [Arabidopsis thaliana] 


9e-10 


262 


M 10946 


Chicken aldolase D 
gene, complete cds, 
clones lambda- 
C(U.1.4). 


L6 


2746775 


(AF040640) similar to peptidase 
family C 19 (ubiquitin carboxyl- 
terminal hydrolase) 
[Caenorhabditis clcgansl ! 


le-31 


263 


X07881 


Human gene PRB3L 
for proline-rich 
protein G I 


1.5 


<N0NE> 


<NONE> 


<N0NE> 


264 


U22260 


Nicotiana tabacum 
UMP synthase (pyr5- 
6) mRNA. partial cds 


L5 


3880923 


(Z99271) similar to Reverse 
transcriptase comes from this 
gene [Caenorhabditis elegans] 


0.50 


265 


U76759 


Mus musculus 
nuclear protein 
NIP45 mRNA. 
complete cds 


L4 


1330394 


(U5876l)C0lF1.6 gene product 
[Caenorhabditis elegans] 


8.9 
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Nearest Neighbor fBlasiN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












Rmas&iuM- 




266 


AFO76470 


Rice tun&ro 
bacilliform virus 
Serdang strain, 
complete genome 


1.4 


1703461 


TROTSPORT1NUA1PASE 

BETA CHAIN (PROTON 
PUMP) (GASTRIC H+/K+ 
ATPASE BETA SUBUNIT) 
3.6.1.36) beta chain - human 
>gi|l84105 (M75ii0)H,K~ 
ATPase beta subunit [Homo 
sapiens) 


8.9 


267 


X64659 


P imzchus interferon 
gene for interferon 
gamma 


1.4 


' 1486485 


(U28832)US10[Gallid 
herpesvirus 1] >zi\ 1486497 


6.8 


268 


U 11825 


Schistosoma 
japonicum smtcrural 
muscle protein 
paramyosi n rnRNA, 
complete cds. 


0.88 


<NONE> 


<NONE> 


<NONE> 


269 


D84278 


Utimaii HNA For 

CD38. exon 1 


0,68 


3766363 


(AL031907) hypothetical serine 
rich protein 

fSchizosaccharomyces^ombel 


3.0 


270 


M59755 


Bovine lens aldose 

ICUIK. Wit 

pseudoeene* 3* end. 


0.67 


<N0NE> 


<NONE> 


<NONE> 


271 


M81758 


Homo sapiens 
skeletal muscle 
voltage-dependent 
sodium channel alpha 
subunit (SkMi) 
mRNA, complete cds. 


0.65 


2437819 


(Z86105) 1.4-beta-glucanase 
[Anaerocellum thermophilum] 


3.6 


272 


L0I965 


Human type IV 
sodium channel alpha 
polypeptide 


0.64 


2437819 


(Z86105) 1,4-beta-glucanase 
{Anaerocellum thermophilum] 


3,5 


273 


U90122 


Danio rerio bone 
morphogenetic 
protcin-4 (bmp4) 
mRNA, partial cds 


0.63 


2983532 


(AE000720) formate 
dehydrogenase alpha subunit 
(Aquifex aeolicus] 


7,9 


274 


L41624 


Hylobates lar mucin 
(MUC1) gene, exons 
1-6. 


0,63 


1517808 


(D79215) FGM0 [Rattus 
norvegicus] 


091 



1^9 
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Nearest 


Neighbor (BlastN vs. Gcnbank) 


Nearest Neighbor fBlastX vs. Non-RednnHnnt t^.;-^ 


SEQ 
ID 


ACCESSION 


f DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 
(U6/9:>6) coafrtor'by g 


P VALUE 


275 


AF030881 


Fugu rubripes sushi 
retrotransposon gag 
polyprotein (gag) and 
pol polyprotein (pol) 
genes, complete cds 


0.63 


1519696 


eiegans cDNA ylcl^o^ j; coded 
for by C. eiegans cDNA 
ykl59h6.3; coded for by C. 
eiegans cDNA yk!26<9.3; coded 
for by C eiegans cDNA 
yk!59h6.5 (Caenorhabditis 
eiegans] 


0.38 


276 


U52909 


Arabidopsis thatiana 
Ul snRNP 70K 
protein gene, 
complete cds 


0.62 


" <NONE> 


<NONE> 


<NONE> 


277 


AF008192 


Homo sapiens 
putative GR6 protein 
(GR6) mRNA, 
complete cds 


0.62 


3800934 


(AF100655) contains similarity 
to ser/ihr protein kinases 
[Caenorhabditis eiegans] 


9.7 


278 


Ur/081 


Human fatty acid 
binding protein 
(FABP3) gene t 
complete cds 


0.62 


3617848 


(AF049709) ryrosylprotein 
sulfotransferase~A; TPST-A 


1.1 




A vim qia(\ 


Homo sapiens mRNA 
forKlAA0797 
protein, partial cds 


0.62 


424044 


VP5 protein - porcine rotavirus 
>gi|61355 


7.7 






Rsapiens mRNA for 
leukocyte adhesion 
glycoprotein p 150,95 


0.62 


1054945 


(U38621) polyprotein [Tobacco 
vein mottling virus] 


4.5 


281 


M63138 


Human cathepsin D 
[catD) gene, exons 7, 
8, and 9, 


0.62 


136810 


GLYCOPROTEIN M 
>gi|73791|pir||WMBE5l UL10 
protein - human herpesvirus 1 1- 
473) [Human herpesvirus 1] 
>gi|221732|gnl|PIDidl00213i 


3.5 


282 


X76056 i 


spacer region 
between 25S and 18S 
ribosomal RNA genes 


0.62 


< 

2661176 


JJ76671) putative cds 
TUiodobacier sphaeroides] 


2.0 


283 


] 

X7450I , 


).taurus mRNA for 
kCTH receptor 


0.62 


4249552 


AB001075) galectin-2 related 
jrotein 


2.0 


284 


I 

5 

M57634 t 


*atFi-ATPase beta 
ubunit mRNA, 3' 


0.62 


t 
t 

t 

2119692 t 


ransforming growth factor- beta 
ype III receptor - chicken 
>gi|5 1 1 843 (L0 1121) 
ransforming growth factor- beta 
ype III receptor [Gallus gallusj 


1.5 



5^0 
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SEQ 
ID 


? Nearest 
ACCESSION* 


Neighbor (BlastN vs. ( 
I DESCRIPTION 


jenbank) 
P VALUE 


li Nearest Neieh 
ACCESSION 


bor ( BlastX vs. Non-Redundani Pi 

DESCRIPTION 
ASPAK1 YUaSPAKAGINYL 


oteins) 
P VALUE 


285 


Y 15724 


Homo sapiens 
SERCA3 gene, exons 
1-7 (and joined CDS) 


0.62 


2498164 


Bb ! A-H Y DROXYTASE" 

(ASPARTATE BETA- 
HYDROXYLASE) (ASP BETA 
HYDROXYLASE) (PEPTIDE- 
ASPARTATE BETA- 
DIOXYGENASE) beta- 
dioxygcaase (EC L14.1L16) - 
bovine >gi|162694 taurusl 


0.52 


286 


AL010142 


Plasmodium 
falciparum DNA *** 
SEQUENCING IN 

from contig 3-72, 
complete sequence 


0,62 


3183206 


HYPOTHETICAL PROTEIN 
KIAA0009 sapiens] 


4e-07 


287 


AB0O8160 


Mus musculus Stat 3 
gene, 5-flanking 
icgiun anu exon i 
partial sequence 


0.62 


466097 


HYPUTHL riCAL 633 KB 
PROTEIN ZK35MIN 
CHROMOSOME Ul 
>gi|1078903|pir||S44654 
ZK353,1 protein - 
Caenorhabditis elegans 
>gi|289757 (LI 53 13) putative 
Caenorhabditis elegans] 


le-35 


288 


AB018795 


Halomonas marina 
gene for alginate 
vase, complete cds 


0.62 


3877493 


(Z.4itt8j) similar to AlFases 
associated with various cellular 
activities (AAA); cDNA EST 
EMBL:Z14623 comes from this 
gene; cDNA EST 
EMBL:D75090 comes from this 
gene; cDNA EST 
6MBL;D72255 comes from this 
gene; cDNA EST yk-?00e4... 


3e-46 


289 


< 

Z69906 


sequence from 
20smidE14iE2, on 
chromosome 22, 
:omplete sequence 
Homo sapiens] 


0,61 


<NONE> 


<NONE> ! 


<NONE> 


290 


I 

I 

t 

U18259 i 


iuman clone CHTA- 
I MHC class II 
ransactivator CIITA 
nRNA, complete cds. 


0,6 i 


( 

1483567 


X 7998 3) viral proteinase 
Pscudorabies virus] 


9.8 


291 


< 
t 

X98890 t 


J.tuberosum mRNA 
or inorganic 
>hosphate 
ransporter, StPTl 


0.61 


( 

475724 


U0S884) protein VIII precursor 
Bovine adenovirus tvpe 3] 


7.6 | 



Ml 
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<5EO 
ID 


Nearest 
ACCESSION 


Neighbor (BlastN vs. C 
f DESCRIPTION 


jenbank) 
P VALUE 


Nearest Neigh 
ACCESSION 


uor (BlastX vs. Non-Redundant Pi 
DESCRIPTION 


•oteins) 
P VALUE 


292 


U70825 


Rattus norvegicus 5- 
OXO-L-prolinase 
mRNA, complete cds 


0.61 


733543 


(U23448) similar to genome 
polyprotein 

(SP:POLG_BVDVN. P197L1); 
alternative splicing to C04A2.7a 


4.4 


293 


L81667 


Homo sapiens 
(subclone 2_a9 from 
PI H49) DNA 
sequence 


0.61 


2565087 


(U80759) CAGH4 alternate 
open reading frame [Homo 
sapiens] 


3.3 


294 


AE0OO76O 


Aquifex aeolicus 
section 92 of J 09 of 
the complete genome 


0.61 


2811092 


HOMEOBOX PROTEIN HOX- 
A3(HOX-1.5)homcobox- 
containing transcription factor 
[Mus musculus] 


2,6 


295 


U58512 


Mu$ musculus Rho- 
associated, coiled- 
coil forming protein 
kinase p 160 ROCK- 1 
mRNA, complete cds 


0.61 


295671 


(Li 1275) selected as a weak 
suppressor of a mutant of the 
subunit AC40 of DNA 
dependant RNA polymerase I 
and III 


1.5 


296 


U27459 


Human origin- 
recognition complex 
protein 2 homolog 
hORC2L mRNA, 
complete cds 


0.61 


200285 


(M97900) putative open reading 
frame [Mus musculus] 


0.66 


297 


L36680 


Pisum sativum S- 
adcnosylmethionine 
synthase mRNA, 3' 
end. 


0.61 


2285790 


(AB002086) p47 [Rattus 
norvegicus] 


4e-12 


298 


AE000673 


Aquifex aeolicus 
section 5 of 109 of 
the complete genome 


0.61 


3395782 


(AF058446) histonc 
macroH2A1.2 [Gailus eallus) 


6e-27 


299 


AF086310 


Homo sapiens full 
length insert cDNA 
:lone ZD51F08 


0,61 


3646450 


(AL031603) conserved 
hypothetical protein. 
[Schizosaccharomyces pombe] 


8e-29 


300 


< 

AJ009675 


A°iTkiis insilon 
mRNA for 3-hydroxy* 
}-methylglutaryl 
:oeniyme A : 
cductase 


0.61 


4176370 < 


aluuxdo; similar to calcium- 
ndependent phospholipasc A2; 
similar to AC0O4392 
PID:g3367519) [Homo 
sapiens) 


2e-75 


301 


< 
< 
( 
( 

AC005577 


omo sapiens 
rhromosome 19. 
:osmid F18382B. 
•entromeric end* 
omplete sequence 
Homo sapiensl 


0.60 


<NONE> 


<NONE> 


<NONE> 
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SEQ 
ID 


Nearest 
ACCESSION 


Neighbor fBlastN vs. ( 

DESCRIPTION 
Candida albicans 


jenbank) 
P VALUE 


Nearest Neiph 
ACCESSION 


bor (BlastX vs. Non-Redundant Pi 
DESCRIPTION 


roteins) 
P VALUE 


302 


U40454 


topoisomerase type I 
<C ATOP 1) gene, 
complete cds 


0.60 


<NONE> 


<NONE> 


<NONE> 


303 


J01390 


Emericella nidulans 
mtDNA between 
h2/ho and bh2/b2 
junctions, genes for 
ATPase subunit 6, 
cytochrome oxidase 
subunit 3, seven, 
unidentified proteins, 
twentyfour tRNA's 
andL^rRNA. 


0.60 


■ 

<NONE> 


<NONE> 


<NONE> 


304 


LI 1172 


Plasmodium 
falciparum RNA 
polymerase I gene, 
complete cds. 


0,60 


<NONE> 


<NONE> 


<NONE> 


305 


Z31079 


Caenorhabditis 
elegans cosmid 
F39H11, complete 
sequence 
Caenorhabditis 
elegans] 


0.60 


<NONE> 


<NONE> 


<NONE> 


306 


Z49627 


S.cerevisiae 
chromosome X 
reading frame ORF 
YJR127C 


0.60 


118751 


MAJOR DNA-BINIMNG 
PROTEIN herpesvirus 1 (strain 
ll)>gi|60327 (X64346) major 
ssDNA-btnding protein 
Saimiriine herpesvirus 2] 


9.6 


307 


U94911 


Rattus norvegicus H- 
K-ATPasealpha2 
gene, alternatively 
spliced products and 
martial cds 


0.60 


2213862 


(AF003086) PfSNF2L 
[Plasmodium falciparum] 


7.4 


308 


1 

J 
( 

U67476 


Vtethanococcus 

UI1IIU34.I1I1 avllIVlll ID 

>f 150 of the 
complete genome 


0.60 


1749688 


,D89240) unnamed protein 
product 


5.7 


309 


j 
c 

U67513 c 


vlcthanococcus 
annaschii section 55 
>f 150 of the 
omplete genome 


0.60 


( 

3327421 , 


U97068) zonadhesin [Mus 
nusculus] 


4.3 


310 


1 
1 

U57817 c 


-laemophilus ducreyi 
ipoprotein gene, 
omplete cds 


0.60 


\ 

4008577 


AL03449 1) conserved 
lypothetical protein 
Schizosaccharomyces pombe] 


2.5 
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SEC 
ID 


J Nearest 
ACCESSKtt 


Neighbor (BlastN vs. ( 
* DESCRIPTION 


jenbank) 
P VALUE 


| Nearest Neieh 
ACCESSION 


tor (BlastX v St Non-Redundanr P 
DESCRIPTION 


roteins) 
P VALUE 








probable protein-tyrosine kinase 


311 


X80700 


H. sapiens Gl7 gene 


0.60 


422541 


(EC 2.7.1.1 12) RTK - Pacific 
electric ray >gi|29Q858 


1.5 


312 


L42167 


Mus musculus (clone 
R24) rds gene, partial 
cds 


0.60 


4220848 


(AF033823) moira [Drosophila 
mclanogasterj 


0.51 


313 


I U54777 


Human hMSH6 
mRNA, complete cds 


0.60 


2665637 


(AF031087) mismatch repair 
protein MSH6 [Mus musculus] 


5e-07 


314 


D86985 


Human mRNA for 
KIAA0232 gene, 
complete cds 


0.60 


1938462 


(U97006) No definition line 
found (Caenorhabditis eleeansl 


2e-07 


315 


D43964 


Rat liver mRNA for 
Kan- 1, complete cds 


0.60 


1280135 


(UiD37o) coded for by C 
elegans cDNA cm21e6; coded 
for by C. elegans cDNA 
cm01e2; similar to melibiose 
carrier protein 

(thiomethylgalactostdc penrtcase 
II) 


5e-15 


316 


U49058 


Rattus norvegicus 
GTD-binding SR-like 
protein rA4 mRNA, 
partial cds i 


0.60 


2145091 


(U37500) RNA polymerase II 
largest subunit TMus musculus] 


lc49 


317 


X84388 


U.ruddi 

mitochondrial 12S 
ribosomal RNA 


0.60 


3874247 


(Z70205) predicted using 
Genefinder 


2e-37 


313 


AF125447 


Caenorhabditis 
elegans cosmid 
Y14H12B 


0,59 


<NONE> 


<NONE> 


<NONE> 


319 


U20189 


Hyoscyamus muticus 
clone cVS2 
vetispiradiene 
synthase mRNA» 

Ml Utll LUa. 




<NQNE> 


<NONE> 


<NONE> 


320 


i 

M63962 < 


Human gastric H t K- 
<\TPase catalytic 
mbunit gene, 
:omp!ete cds. 


0.59 


<NONE> 


<NONE> 


<NONE> 


321 


I 
( 
F 

A J I 32366 ( 


-lelicobacter pylori 
strain Pi) eomB and 
>mi/algA (partial) 
:encs. and partial 
}RFl and ORF2 


0,59 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


I Nearest Neighbor (BlastX vs. Non-Redundant PmteinO 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


_ ACCESSION 


DESCRIPTION 


P VALUE 




Mus musculus 






322 


U17^89 


transcription factor 
AP-2 (AP-2) gene, 
alternative exon la, 
and isoform 2, partial 


V.J 


&*TJ7** 1 Jf 


(AC0O2332) hypothetical 
protein [Arabidopsis thaliana] 


9.4 


323 


Z71466 


S.cerevisiae 
chromosome XIV 

YNL190W 


0.59 


3875542 


(Z67990) Similarity to Rat 
amiloride-sensitive sodium 
channel beta-subunit 


7.3 


324 




genes for 13K, 22K 

tth/1 axiC nrnt^inc 
oflii *tOIV JJllHdllo 


ft SO 


xl 1750 f 


cry V465 protein - Bacillus 
thuringiensis thuringiensis] 


7,2 


325 


L4L351 


Homo sapiens 
prostasin mRNA, 
complete cds 


0.59 


729212 


CRYSTALLIN J1C crystallin 
[Tripcdalia cystophora] 


4.2 


326 


X79854 


0.111 VUIJlCltili gvllC 

for 16S ribosomal 
RNA 


0.59 


3702828 


(AF056577) high mobility 
^roup protein 1.2 


3.2 


327 


AJ223356 


Strongylocentrotus 
purpuratus mRNA for 
SuDp98 protein 


0-59 


2495704 ! 


HYPOTHETICAL PROTEIN 
KIAA0I29 product is novel. 
Homo sapiens] 


2.5 


328 


X860L9 


H.sapiens mRNA for 
PRPL-2 protein 


0.59 


1743341 


(Y 10027) transcription factor 
TEF-1 [Mus musculus] 


2.5 


329 


U75528 


Xiphlas gladius 
partial cds 


0,59 


1845995 


(U69477) envelope glycoprotein 
Human immunodeficiency virus 
tvpe 1] 


2.4 


330 


AC005573 


Homo sapiens 
chromosome 5 t PAC 
clone 202eI3 


0.59 


2506366 


DNA POLYMERASE 
EPSILON SUBUNIT B DNA- 
directed DNA polymerase (EC 
2.7.7.7) II chain B - yeast. 
(Saccharomyces cerevisiae) 
>gi|786319 (U25842) DNA 
Polymerase epsilon, subunit B 
(Swiss Prot. accession number 
P244S2) [Saccharomyces 
cerevisiae) 


1.4 


331 


H9180 


Rat receptor-linked 
protein tyrosine 
phosphatase 


0.59 


1235974 1 


(X96713) collagen [Globodera 
pallida] 


1.1 


332 


L32090 


Listeria 

monocytogenes sec A 
eene v complete cds. 


0,59 


2291129 


(AF016415; No definition line 
found (Caenorhabditis eleeans) 


0*83 



WO 01/02568 



PCT/USOO/18374 





Nearest 


Neighbor f BlasiN vs. Genbank) 


Nearest Neighbor (BlastX vs. Nnn.Rcdunri.mr Rmirinci 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Xcnopus laevis 






(AL03I124) hypothetical 




333 


U24433 


syndecan-2 mRNA, 
complete cds. 


0,59 


3355692 


protein SClC2-25c 
[Sireptomyces coelicolor] 


0.64 


334 


M23412 


Drosophila 
muscarinic 

acetylcholine receptor 
mRNA, complete cds 


0.59 


168237 


(M76546) hydroxyproline-rich 
protein [Helianthus annuus] 


0.22 


335 


AF060729 


Synaphea media 
chloroplast atpB-rbcL 
intergenic spacer 
region, partial 
sequence 


0.59 


731596 


kYPUimurjAL6/J>KD 

PROTEIN IN PRPS4-STE20 
INTERGENIC REGION 
>gi|626567|H|S46$25 
hypothetical protein YHLOLOc - 
yeast (Saccharomyces 
cerevisiae) >gi|2289881 
(Ul 1582) No definition line 
luunu [OuLLiiiiruiiiyLcs 
cerevisiae] 


0.16 


336 


AF029734 


Xanthobacter 
autotrophicus 
transcriptional 
activator AldR (aldR) 
gene, partial cds; and 
NAD-dependent 
chloroacetaldehyde 
dehydrogenase (aldB) 
gene, complete cds 


0,59 


2498801 


PERIAXIN 

>cil2 14390 i lDtrllI5 8 1 57 oeriaxin 
- rat >gi|505297 (Z29649) 
periaxin [Rattus norvegicus] 


0.13 


337 


X95307 


C.reinhardtii LI8l8r- 
1 gene 


0.59 


■ 

1723781 


HY'KJl'HfcTlLAL 343 lOJ 
PROTEIN IN TAF145-YOR1 
INTERGENIC REGION 
>gi|21317l7|pir||S646l2 
hypothetical protein YGR277C - 
yeast (Saccharomyces 
cerevisiae) 

>gi|1323505|gnl|PID|e243248 
(Z73062) ORFYGR277c 
Saccharomyces cerevisiae) 


le-04 


338 


M24572 


Dicryostelium 
liscoideum tRNA- 
Glu-GAA gene, clone 
yGluGAAS, 


0.59 


1176186 


HYPOTHETICAL 43.3 KD 
GTP-BINDING PROTEIN IN 
DACB-RPMA INTERGENIC 
REGION >gt|606121 colil 


3e-06 


339 


] 

U73733 < 


luman hMSH6 gene. 
:.xon 2 


059 ! 


1 

2665637 j 


(AF031087) mismatch repair 
protein MSH6 [Mu$ musculus] 


5e~07 



V4b 
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1 Nearest Neighbor fRInslN v<; r.^nhnol^ 




sec 

1 ID 


>[ 

JACCESSIOr 


\ DESCRIPTION 
Escherichia coii 


P VALUE 


-1 1,VU1 ^ ^^ipiportpiastA vs. Non-Redundant P 

1 ACCESSION 1 DESCRIPTION 


roteins) 

p value! 


340 


I D90747 


genomic DNA. (25.2 
25 6 mirrt 


U.J7 


! 134286 


DOLICHOL KINASE 


6e-08 J 


341 


1 J052II 


Human desmoplakin 
mRNA, 3* end. 


0.59 


1 246796 


Imajor centromere protein, 
jCENP-B [human, Peptide* 594 
Jaa] 


4e 08 J 


I 342 


| L24441 


Loligo peolit kinesin 
light chain mRNA, 
comoletp rrl<c 




1 547800 


KINESIN LI(5HT CHAIN 
j(KLC) sea urchin 
kStrongylocentrotus purpuratus) 
|>Bi| 161530 


5^14 1 


343 


1 M25140 


Human cardiac aipha- 
myosin heavy chain 
(MYH6) gene, exons 
2. 3 and 4 




| <NONE> 


i <NONE> 


<NONE>| 


344 


L81932 


Homo sapiens 
(subclone 9_h2 from 
PL H21) DNA 
sequence 


0.58 


! <NONE> 


<NQNE> 


<NQNE>1 


345 


AF087966 


length insert cDNA 
clone YU51G04 


0.58 


<NONE> 


<NONE> 


<NONE>| 


346 


Z78574 


H, sapiens flow-sorted 
chromosome 6 TaqI 
fragment, 
SC6pA10GH 


0.58 


<NONE> 


<NONE> 




I 347 I 


AF068061 


Blattella germanica 
allatostatin 
neuropeptide 
precursor, gene, 
complete cds 


0.58 I 


<NONE> 


<NONE> 


<NONE>| 
<NONE>| 


348 1 


AF015592 


Homo sapiens Cdc7 
(CDC7) mRNA 

* f IfUVl 1^ 

:omplete cds 


0.58 I 


<NONE> | 


<NONE> 


<NONE> I 


349 I 


] 
I 

i 

5 
5 
S 
S 
( 

AF028OO6 a 


VfelhanOsarcina 
>arkeri atp operon: 
\TP synthase beta 
ubunit (aipD), ATP 
ynthase epsilon 
ubunit (atpC), ATP 
ynthase gene 1 
utpl), ATP synthase 
subunit subunit (.., 


058 


( 

3184291 I 


AC004136) putative DNA 
K>lymerase III gamma subunit 




1 350 1 


f 

ABO 17032 c 


ftus museulus gene 
or pancreatic trypsin, 
omplete cds 


0.58 1 


( 

3170561 |f 


AF056704) synapsin Ilia 
Rattus norvegicus] ' 


9.4 J 
9,2 1 
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Nearest Neighbor fBlastN vs. Genbank) 


Nearest Neighbor CBIastX vs. Non-Redundant Pi*v*in<> 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Dictyostelium 










351 


AF081585 


discoideum 
developmental 
protein DG1I10 
(DGU 10) gene, 
partial cds 


0.58 


105417 


basic proline-rich peptide IB-8a 
human 


9.2 


jjZ 


AF086322 


Homo sapiens fill! 
length insert cDNA 
clone ZD53E01 


0J8 


93026 


hypothetical protein - African 
swine fever virus (strain Malawi 
Lil 20/i) >gi|450758 (X71982) 
myeloid differentiation antigen 
homologue [African swine fever 
virus] >gi|903686 (M95672) 
unknown protein 


7.1 


353 


AF088025 


Homo sapiens full 
length insert cDNA 
clone ZC19C04 


0.58 


2384644 


(U92805) thrombospondin-3 
[Xcnopus laevlsl 


7.0 


354 


AB002339 


Human mRNA for 
KIAA0341 gene, 
partial cds 


0,58 


' 2135587 


Ml 30 antigen (cytosolic variant 
2) - human 


54 


355 


U67548 


Mclhanococcus 
annaschii section 90 
of ISO of the 
complete genome 


0.58 


2911094 


(AL02L957) hypothetical 
protein Rv2174 


4,2 


356 


L07868 


Homo sapiens 
receptor tyrosine 
kinase (ERBB4) 
gene, complete cds. 


0.58. 


461922 


PVKUVAlb 

DECARBOXYLASE (8-10 NM 
CYTOPLASMIC FILAMENT- 
ASSOCIATED PROTEIN) 
(P59NC) 4.1.1.1)- Neurospora 
crassa >gi|293948 (L09125) 
pyruvate decarboxylase 
Neurospora crassa] 
>gi| 1655909, 


4.2 


357 


X03897 


Bacillus subtilis 
sigma 43 operon with 
P23-dnaE-rpoD genes 
(dnaE for DNA 
primase* rpoD for 
RNA polymerase) 


0.58 


1323704 


(U55387) similar to C, elegans 
F38E1-9 gene product encoded 
by GenBank Accession Number 
U41996 [Cricetulus griseus] 


4.1 


358 


D76419 


Desulfovibrio 
vulgaris rbo gene for 
desulfoferrodoxin and 
rub gene for 
rubredoxin, complete 
cds 


0.58 


3420047 


(AC004680) puiaiive protein 
kinase [Arabidopsis thatiana] 


2.4 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Ncifihbor (BlastX vs. Non-Redundant Protein^ 


SEQ 

id 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Human DNA 










359 


Z82174 


sequence from 
cosmid B20F6 on 
chromosome 22, 
complete sequence 
[Homo sapiens) 


0.58 


2145455 


(Y07866) catalasc peroxidase 


2.4 


360 


M33642 


F.solani STI35 
protein gene, 
comolete cd* 


0.58 


2896706 

' yjyj 


( AL02 1 897) hypothetical 

nrntpin RvlOriQr 

LFI UICII1 IV V 1 VU7v 




361 


U64873 


Mus musculus 
transforming growth 
factor alpha (TGF 

aloha) srene nflrtial 

cds 


0.58 


3874437 


(Z81038) predicted using 
Gcncfinder; cDNA EST 

[Caenorhabditis elegans] 


18 


362 


AB002132 


MacrophthaJmus 
banzai mitochondrial 
DNA for 12S and 
16$ rRNA, partial 
and complete 
sequence 


0.58 


2960022 


(AJ224676)rhotypeGEF 
[Drosophila melanogaster] 


1.8 


363 


AF070070 


Caenorhabditis 
elegans MutS 
homolog (msh-5) 
mRNA. partial cds 


0,58 


4098205 


(U75869) Omp22 (Helicobacter 
pylori] 


1.8 


364 


AF045240 


Staphylococcus 
epidermidis plasmid 
pIP1629 mobilization 
protein (mobC I), 
(orf69-l), (mobAl), 


0.58 


42I8U7 


(AL035353) protein (fragment) 


0.62 


365 


X61637 


H,sapiens Wilms 
tumor gene 1 , exons 8 
and 9 


0.58 


2331059 


(U8821 H unknown TGallus 

^ yUUw A X f L*IA*V4 Jw W 11 [Willi III 

gallus] 


0.62 


366 


AF039312 


Moraxclla catarrhalis 
strain 4223 transferrin 
binding protein A 
(tbpA) and transferrin 
binding protein B 
(tbpB) genes, 
complete cds; and 
unknown gene 


0.58 


120155 


FIBER PROTEIN 
>gi|74229|pir||ERADFM Tiber 
protein - mouse adenovirus 1 
>gi|209758 (M30594) fiber 
protein [Mastadc no virus musl] 


0.27 


367 


D87463 


Human mRNA for 
KIAA0273 gene, 
complete cds 


0.58 


3861477 


(U94 177) androgen receptor 
[Pan troglodytes] 


0.12 


368 


U40342 


Mus musculus ninein 
mRNA. complete cds. 


0.58 


4115936 


(AF 1 18223) No definition line 
found [Arabidopsis thaliana] 


0.0O4 
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SEQ 
ID 


! Nearest 
ACCESSION 


Neighbor (BlastN vs T ( 
f DESCRIPTION 


jenbank) 
P VALUE 


1 Nearest Neiahl 
ACCESSION 


lx)r (BlasiX vs. Non-Redundant Pi 
DESCRIPTION 


roteins) 
P VALUE 


369 


! S57235 


transmembrane 
glycoprotein [human, 
promonocyte cell line 
0937, mRNA, 1722 
nti 


0.58 


2072501 


(U96U3)WWPl[Homo 
sapiens] 


le-04 


370 


U39391 


Mus musculus 
serotonin I A receptor 
mRNA, complete cds. 


0.58 


1469876 


(D63481) The KIAA0147 gene 
product is related to adenylyl 
cyclase. [Homo sapiens] 


le-07 


371 


D00056 


Monkey B- 
lymphotropic 
papovaviitis genes for 
VP-l,2,3and large 
T antigen, complete 
and partial cds, strain 
LPV-76>:: 
gb)Mi4494|PPMVPl 
M Monkey B 
lymphotropic 
papova virus mutant 
(LPV-76) PstI B 
fragment encoding 
VPl. VP2, VP3and 
T-unti cen. 


0 58 




(AJ001774) vanadium 
chloropcroxidasc 


le-08 


372 


M77182 


Amsacta 
entomopox virus 
sphcroidin gene, 
complete cds, and 
four vaccinia related 
orfs. > :: 

gb|I16670|I16670 
Sequence 1 from 
patent US 5476781 


0,58 


1730722 


HVPUlHhllLAL4i.li KU~ 
PROTEIN IN NCE3-HHT2 
INTERGENIC REGION 
>gi|213i871|pir||S62957 
hypothetical protein YNL035c - 
yeast (Saccharomyces 
serevisiac) 

>gi| l301880|gnl|PID|e239670 
Z71311)ORFYNL035c 
Saccharomyces cerevisiae] 


8e-14 


373 


i 

S72579 < 


gloo-S=growth- 
associated protein 
3AP-43 homolog 


0.58 


( 

2689720 


AF037168) DnaJ homologue 
Arabidopsis thaliana] 


7e-14 


374 


«- 
f 

AF018165 c 


fetraodon fluviatilis 
imyloid precursor 
xotein mRNA T 
•omplete cds 


0.58 


1 
( 

3219938 t 


SYNTHETICAL 34.5KD 
'ROTEIN C57A10.UC IN 

:hromosome i 

>gi|2058378|gnl|PID|e3 1 4002 
K>mbe] 


5e-22 



"iS8 
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Nearest 


Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 














375 


U81803 


FTlobasidiella 
neoformans 
translation elongation 
factor EF1 -alpha 
(CnTEFl)mRNA, 
complete cds 


0.57 


<NONE> 


<NONE> 


<NONE> 


376 


U09781 


Candida albicans 
ATCC 18804, CBS 
562 peptide 
transporter gene, 
complete cds. 


0.57 


<NONE> 


<NONE> 


<NONE> 


377 




Homo sapiens 
(subclone 4J>10 from 
BAG H102) DNA 
sequence 


0.57 


<NONE> 


<NONE> 


<NONE> 


378 


U23442 


Tetrahymena 
thermophilaRR 
internal deletion 
sequence. 


0.57 


<NONE> 


<NONE> 


<NONE> 


379 


U17289 


Mus musculus 
transcription factor 
AP-2 (AP-2) gene, 
alternative exon la, 
and isoform 2, partial 
cds. 


0.57 


<NONE> 


<NONE> 


<NONE=> 


380 


X70844 


Buzura suppressaria 
nuclear polyhedrosts 
virus gene for 
polyhedrin protein 


0.57 


<NONE> 


<NONE> 


<NONE> 


381 


AJ012159 


Homo sapiens 5T4 
oncofetal trophoblast 
lycoprotein gene 


0.57 


<NONE> 


<NONE> 


<NONE> 


382 


X76571 


Lsapiens simple 
DNA sequence region 
:lone wglaS, 


0.57 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genb ank) 

seq[ 

JgJ ACCESSlONl DESCRIPTION I p VALUE 



Nearest Neig hbor (BlastX v S . Non-Redundant Protein^ 
ACCESS ION ! DESCRIPTION 



PVALUEl 



vibiiui'Muieiae 

(pathogenicity island, 
I putative transposase, 
aldehyde 
(dehydrogenase 
(aldA),toxR- 
activated gene A 
I protein (tag A), 
putative inner 
membrane protein, 
and putative zinc 
roetalloprotease 
genes, complete cds; 
383 | AF034434 land... 



384 | AB017031 



[Mus musculus gene 
forTESP4, complete 

cds 



0.57 



<NONE> 



<NONE> 



<NONE> 



385 | X89788 



iS.hispidus 
I mitochondrial DNA 
for SSU ribosomal 
RNA gene 



0.57 



<NONB> 



<NONE> 



|<NONE> I 



386 



[Rat progesteron 
Ireceptor gene, 5* 
L 1 692 1 I untranslated region. 



0.57 



<NONE> 



<NONE> 



l<NONE> 



0.57 



3323116 



(AE(X)1251)femA protein, 
putative [Treponema pallidum! 



I Homo sapiens 
I interferon regulatory 

387 | AF027292 factor 6 [ o 57 

Cicer arietinum 
ImRNA for 

388 1 AJ012581 cytochrome P450 



259790 



l<^48157) DNA polymerase- 
Iprimase 180 ledasubunit 
nDrosophila melanogaster. 
Peptide, 1490 aa] 



8.9 



6.7 



0.57 



2131498 



hypothetical protein YDR446w - 
yeast C AI: 0.11 [Saccharomycesl 
cerevisiael j 5.3 



[Human transfer RNA-I 
jMet (TRMEPI) 
Ipseudogene, complete 
389 I L15363 |oene | 0=57 



I Helicobacter pylori 
26695 section 3 of 
1 134 of the complete 
390 | AE0OQ5 25 |genome f 0 .57 



3228680 



(AF070935) GAB A receptor 
subunit [Musca domestical 



5.2 



|(U97008) weak similarity to 
I family I of G-protein coupled 
receptors [Caenorhabditis 
1938478 elegans] | 4.0 



391 



Ambiyomma 
americanum 
ecdystcroid receptor 
(AamEeR) mRNA, 
AFQ20189 3'UTR.recion 1 



(U94875) p40 [Borna disease 
.0.57 f 2072224 |virusl 



4.0 
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SEC 
ID 


| Nearest 

I 

_ ACCESSIOr 


Neighbor (BlastN vs. 
^ DESCRIPTION 


Genbank) 
j P VALUE 


Nearest Neignoor (BlastX v< Non-Redundant Proteins) 

ACCESSION 1 DESCRIPTION P VALUE 


392 


J X56997 


coding for ubiquitin- 
52 amino acid fusion 
protein 


0.57 _ 


2960113 


J(AL022 12 1) hypothetical 
Piytein Ry3689 


4.0 1 


393 


1 ALO1026O 


Plasmodium 
falciparum DNA *** 
SEQUENCING IN 
PROGRESS *** 
from contig 4-81, 
complete sequence 


0.57 


117233 


(CYPHC14) phenobarbital- 
inducible. hepatic ^ rabbit P-45C 
HOryctolagus cuniculus] 
>gip58265jprf||1306317A 
cytochrome P450 [Oryctolagus 
cuniculus] 


> 1 

3.9 I 


394 


M99581 


Xenopus laevis 
gamma-crystallin 
(gcry3) gene, 
complete cds. 


0.57 


141647 


GAi!fRtJLA mil FINGSk 
PROTEIN XLCGF44.2 
>gi|85736|pir||S06571 finger 
protein (clone XlcGF44-2) - 
African clawed frog (fragment) 


3,0 [ 


395 


M38384 ; 


Drosophila 
melanogostcr seven in 
absentia mRNA, 
complete cds. 


0.57 


1707127 1 


(U80454)T16AL1 
rCacnorhabdiiis elepnnci 


30 I 


396 1 


U32795 


Haemophilus 
influenzae Rd section 
HO of 163 of the 
complete genome 


0.57 


IRON(III)-TRANSPORT 
SYSTEM PERMEASE j 
PROTEIN SFUB >gi|152861 
1173433 (M338 15) protein (sufB) ! 


2.3 1 


397 I 


X12600 , 


Klebsiella 
pneumoniae nifX, 
nifU, nifS. nifV and 
lifW genes 


0.57 


(AL021925) hypothetical 
2909562 protein Rv2256c 1 


1.4 J 


398 1 


I 

t 

AB014526 


iomo sapiens mRNA 
brKIAA0626 
>rotein, complete cds 


. 0.57 


jinsect-stage-specific protein - j 
(Trypanosoma cnizi >gi| 162099 
j(M6502 1 ) insect stage-specific j 
482390 lantiEen 1 


0.61 j 


399 J 


s 
1 
P 

AF063587 c 


hodococcus fascians 
train NRRL-B- 
5096 hypothetical 
rotein gene, 
omplete cds 


0.57 


( 

4104321 p 


AF0345S2) vesicle associated 
rotein [Rattus norveaicusj ! 


0.46 1 


400 1 


C 

L1H17 si 


ruinea pig estrone 
jlfotransferase gene. 


0,57 


a 

82584 d 


pha/beta-gliadin precursor 
■lone A212)- wheat | 


035 1 
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SEC 
ID 


1 Nearest 
IACCESSIOI 


Neighbor f BlastN vs. 
M DESCRIPTION 


Genbank) 
P VALUE 


8 Nearest Neiah 
[ ACCESSION 


b >r (BlasiX vs. Non-Redundant Pmtein^ 

DESCRIPTION PV^rrnrj 


401 


f V00829 


Mouse COmpIeteTJeTT 
for a mouse kallikrei 
gene. Genes ait mGI 
1 (complete gene) 
and mGK-2 of 
hormones, e.g., 
grow... > 

gb|J0039O(MUSKAL 
07 Mouse pseudo- 
kallikrein 2, exons 4 
and 5, and kallikrein 
1 gene, complete cds. 


c 

i 
C 

0.57 


1 2500916 


NUCLEAR HORMONE 
RECEPTOR NOR 2 receptor 
[Ratios norvegicus] 
>gi|1583604|prfIj2l2I281A 
NOR-2 protein [Rattus 
norvegicus] 


0.20 I 


402 


! X53092 


Chicken mRNA for 
beta-2 subunitof 
neuronal nicotinic 
acetylcholine receptor 


V.J / 




(U40953) similar to matrin F/G 
(SP:Q00910) containing C4- 
type zinc-fingers 
Caenorhabditis elegans] 


0.031 1 


403 I 


L07939 


Ovis ovis granulocyte 
colony stimulating 
factor 


0.57 1 


3874345 


{Zh ivii) predicted using 
Genefinder; Similarity to 
dehydrogenases; cDNA EST j 
EMBL:D65800 comes from this 
gene; cDNA EST 
EMBL-D76I84 comes from this 
gene, cDNA EST 
EMBL:D69322 comes from this 
gene; cDNA EST j 
EMBL:C08 158 comes f... 


3e 07 1 


Afhi 1 


1 
( 

U18061 c 


Cotietotrichum 
sloeosporioides 
:AP20 (cap20) gene, 
romplete cds. 


0.57 1 


( 

2914695 s 


AC003974) putative ubiquitin 
jcciflc protease | 


9e-08 1 


405 | 


L 

b 

Z73955 R 


. japonicus mRNA 
or small GTP- 
inding protein, 
ABI1G 


0 57 1 


1 

. / 
3 
F 
f 
> 

P 

A 

112894 a 


rDMUK NEL'KUMS r*ALTOKX 
KLPHA-INDUCED PROTEIN 
(PUTATIVE DNA BINDING 
KOTEIN A20) (ZINC j 
INGER PROTEIN A20) 
gi|l07549jpir||A35797 I 
robable DNA-btnding protein 1 
l20- human >gi| 177866 
V159465)A20 


7e-08 | 



3*54 
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SEQ 
ID 


Nearest 
ACCESSION 


Neighbor (BlastN vs, C 
DESCRIPTION 


Jen bank) 
P VALUE 


Nearest Neieht 
ACCESSION 


»r (BlastX vs. Non-Redundant Pr 
DESCRIPTION 


oteins) 
P VALUE 
















406 


X04335 


Petunia grp- 1 gene 
for glycine-rich 
protein 


057 


3876901 


(CitbW) Similarity to Human 
enoyl-CoA hydratase 
(SW:ECHM^HUMAN); cDNA 
EST EMBL:T006l 1 comes 
from this gene; cDNA EST 
yk203dl0.3 comes from this 
gene; cDNA EST yk203dI0-5 
comes from this gene; cDNA 
EST yk457h5 ,3 comes from t.. 


le-27 


407 


U40718 


Rattus norvegicus S- 
adenosyl methionine 
decarboxylase 
(AMDP2) 
pseudogene 


0.56 


<NONE> 


<NONE> 


<NONE> 


408 


M60318 


S.ccrevisiac SSDl 
protein gene, 
complete cds. > :: 
gb|AR013983|AR0l3 
983 Sequence 8 from 
patent US 5773245 


0.56 


<NONE> 


<NONE> 


<NONE> 


409 


X60057 


Nicotiana tabacum 
blp4 mRNA for 
luminal binding 
protein (BiP) 


0.56 


<N0NE> 


<NONE> 


<NONE> 


410 


AF085930 


Homo sapiens full 
length insert cDNA 
clone YR55A09 


0.56 


<NONE> 


<NONE> 


<NONE> 


411 


AL010189 


Plasmodium 
falciparum DNA *** 
SEQUENCING IN 
PROGRESS *** 
from contig 3-102, 
complete sequence 


0,56 


<NONE> 


<NONE> ' 


<NONE> 




X05402 


Murine G-CSF gene 
for granulocyte 
colony stimulating 
factor precursor 


0.56 


<NONE> 


<NONE> 


<NONE> 


413 


U92280 


Rattus norvegicus 
regulator of G-protein 
signalling 12 
(RGS12) mRNA, 
complete cds 


0.56 


<NONE> 


<NONE> 


<NONE> 


414 


US5660 


Human 

papillomavirus strain 
RTRX7 complete 
senome 


0.56 


<NONE> 


<NONE> 


<NONE> 



3SS 



WO 01/02568 



PCT/USO0/18374 



SEC 
ID 


[ Nearest 

1 

ACCESSION 


Neighbor (BlastN vs. < 
4 DESCRIPTION 


Zrenbank) 
-] ~ 

1 P VALUE 


Nearest Neigh 
ACCESSION 


t>or(BlastX vs. Non-Redundant P 
DESCRIPTION 


tpteins) 
P VALUE 


415 


X57626 


M. javanica 
mitochondrion 
ATPase 6, and 
putative tRNA-f-Mct 
and tRNA-His genes 


1 0,56 


<NONE> 


<NONE> 


<NONE> 


416 


AB003363 


Sus scrofaSiOOC 
gene, complete cds 


0.56 


<NONE> 




<JNUINfc> 


417 


L42291 


Danio rerio DANA 
element, intron 4. 


0.56 


2650002 


(AE001062) conserved 
hypothetical protein 


9 7 
O./ 


418 


AF031826 


Mus musculus 
leukocystatin gene, 
complete cds 


0.56 


462493 


L-LA<JlAlb 

DEHYDROGENASE 
(IMMUNOGENIC PROTEIN 
P36) >eil479296lDirllS33362 L- 
lactate dehydrogenase (EC 
1.1.1.27) - Mycoplasma 
hyopneumoniac 


6,7 


419 


U17068 


Pennisetum glaucum 
Ac~like element, 
AcL2. 


0.56 f 


399449 


ESC ARGOT/SNAIL PROTEIN 
HOMOLOG 


6,7 


420 


248042 


H. sapiens mRNA 
encoding GP1- 
anchored protein 
p!37 


0.56 


141232 


HYPOTHETICAL 8.7 KD 
PROTEIN (READING FR AMF 
D)>gi|76316|pir|jQQSA7C 
hypothetical protein E-74 


6.7 


421 


AF027657 


Lhonstoneura 
fumiferana j 
entomopoxvirus j 
nucleotide 
triphosphate 
phosphohydrolase I j 
(NPHI) gene, 
complete cds f 


0J6 


464999 ; 


PUlAllVb 
ACETYLCHOLINE 
REGULATOR UNO IS 
>gi|480359|pir||S36747 
acetylcholine regulator unc-18 - 
Caenorhabditis elegans 
>gi|247392|bbs| 100294 putative 
icetylcholine regulator unc-18 


5.1 


422 


1 
i 

ABO 11540 < 


flomo sapiens mRNAl 
or MEGF7, partial 
ds j 


0.56 


1 
< 
1 

( 

171S033 f 


URA&L-bNA "' 
jLYCOSYLASE (UDG) 
icrpes virus 2 >gi|695219 
U20824) uracil DNA 
•lycosylase 


5,1 


423 


; 

X59941 f 


C.maculatus NGF 
ene for nerve growth 
jelor 1 


0.56 


f 
C 
I 

[ 

1169081 > 


jomMON Rant 

REGULATORY FACTOR 
:PRF^1 >gi |515621 (X58575) 
ghl-inducible protein CPRF-1 
Petrosal inum crispum] 
gi|!49S301 (U46217) CPRF1 


3.S 



2? € f.fo 



WO 01/02568 



PCT/USOO/18374 




424 | M7271I 



Rat transcriptional 
repressor of myclin- 
specific genes (SCIP) 
mRNA, complete cds. 



0,56 



50J027 



425 I ALQ23850 



426 I U47862 



427 1 VQQ574 



428 I Z7i502 



Caenorhabditis 
elegans cos mid 
Y67D11A, complete 
sequence 
[Caenorhabditis 
elegans} 



0,56' 



266771 



Schistosoma mansoni | 
gynecophoral canal 
protein mRNA, 
complete cds 



0.56 



2147138 



Human germ line 
gene homologous to 
bladder carcinoma 
oncogene T24 (Gene 
code c-Ha-ras- 1) with j 
four exons. 



429 | M37278 



430 1 D28878 



431 | ZI5027 



432 I M 14 362 



433 | Z50801 



XJacvisHl(0M 
gene 



0.56 



1518672 



(U01849) ORF2 [Trypanosoma 
brucci] 

LHURlSMAlfc MU1ASL 



2.3 



(CM) / PREPHEN ATE 
DEHYDRATASE (PDT) (P- 
PROTEIN) 

>gi|281791|pi r j|S26053 
chorismate mutase (EC 5.4.99.5)| 
P / prcphenate dehydratase (EC 

2.1.51) - Erwinia herbicola 
>gi|43344 



2.3 



ATP synthase chain 6 - 
Platymonas subcordiformis 
mitochondrion >gi(633582 
(Z47797) ATP synthase subunit | 
[Platymonas subcordifonmisl 



23 



(U60289) receptor protein 
tyrosine phosphatase psi [Homo | 
sapiens] 



0.56 



1651674 



R.norvegicus renin 
gene, exons 1-9. 



0.56 



2853019 



(D90899) ferrichrome-iron 
receptor 



1.7 



Thermus 

thermophitus poIA 
gene for thermostable | 
DNA polymerase I, 
complete cds 



(AF045141) putative serine 
proteinase [Scirpophaga 
incertulas] 



1.3 



Hsapiens HLA class 
III DNA 



0,56 



3659692 



0.56 



1304141 



Human T-cell surface | 
amigen CD2 (Til) 
mRNA. complete cds.l 



0,56 



2462979 



Z.mays mRNA tor 
hlorophyll a/b- 
bindine protein CP29 1 



0.56 



109677 



(AF06874S) sphingosine kinase | 
[Mus musculus] 



0.77 



(D43758) fibrinogen A-alpha- 
chain 



(Y 11915) Tcnascin-X [Bos 
taurus] 



collagen alpha 1(1) chain 
mouse >gj|50487 



0.76 



0.59 



0.50 



WO 01/02568 



PCT/US00/18374 



SEQ 

id I ACCESSION 



P VA LUE .1 ACCESS ION ! DESCRIPTION 



DESCRIPTION 



435 | AF052254 



S.cerevisiae 
chromosome XIII 
cosmid 9 745 ! _J0.56 

Escherichia coti DNA 
gyrase A (gyrA) genej 
partial cds | o.56 

egula pulligo i2s 



436 1 AFQ80649 



437 I AJ0O569O 



438 | U31202 



small subunit 
ribosomal RNA gene J 
mitochondria] gene 
for miiochondrial 
RNA, partial 
sequence 



Danio rerio mRN A 
for protein tyrosine 
kinase 



0.56 



439 I X51695 



440 I 1/28938 



Human noggin 
(NOGGIN) gene, 
complete cds. 



0.56 



Ovis sp, trichohyalin 
mRNA, partial 



0-56 



Rartus norvegicus 
protein tyrosine 
phosphatase D30 
mRNA, complete cds i 



0.56 



441 \ AEQ01 171 



442 I AF036685 



443 | X01I73 



444 I D10911 



Borrelia burgdorferi 
(section 57 of 70) of 
the complete genome 



0.56 



Cacnorhabditis 
elegans cosmid 
C05BI0 



0.56 



Xenopus laevis 
vitellogenin gene A I 
5' flanking reaion 



0.56 



Mus musculus DNA 
forMS2 pro(ein. 
complete cds 



0,55 



140372 



INTERGENIC REGION 
>gi|83I59|pir||S 19367 
(hypothetical protein YCL039w 
jyeast (Saccharomyces 

cyjsiae) f 035 



2724126 



(AF038535) synaptotagmin VH 
|[Homo sapiens] I 0.12 



CYCLIN-DEPENDENT 
KINASE INHIBITOR i 
3913223 p21/WAFl fFelis catusl 



(AF030962) unknown 
2623830 [Schistosoma mansoni] 



(Z784U)F02D8.3 
0 56 I 3875475 [Cacnorhabditis elegans] 



|(AC004665) unknown protein 
3386622 f Arabidopsis thaliana] 



3293547 



(AF072709) putative 
oxidoreductase [Streptomyces 
[lividans] 



2315521 



(APO 16452) similar to the beta 
transducin family 



1519671 



1118102 



(U41558) K02B2.3 gene 
product [Caenorhabditis 
elegans] 



<NONE> 



<NONE> 



0,11 



7e-06 



ie-06 



Ie-10 



le-14 



4e-l6 



(U67951) contains similarity to" 
ATP/GTP-binding site motif I 
(PS:PS00017) [Caenorhabditis ' 
jclegans] | ^20 



2e-31 
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SEQ 

m 1 ACCESSION 



Nearest Neighbor (Blasts. Genbank) I Nearest Nc^BlastX vs. Non-Redundan, 



445 | D3QQlf) 



446 I U51991 

447 I Ml 8858 



DESCRIPTION 



RicemRNA EN 1 17, 



P VALUE | ACCESSION 



partial sequence 



0.55 



448 | U95218 



449 | M14948 



450 1 AB002353 



,451 | L81689 



Escherichia coli 
phosphoprotein 

Mouse T ceil receptor] 
C-gamma-7.I mRNA^j 

3' end. 

Homo sapiens T cell- 
death associated 
protein gene, 
complete cds 



0.55 



0.55 



0.55 



Human R-ras gene," 
exon 1. 



Human mRNA for 
KIAA0355 gene, 
complete cds 



0.55 



452 I M68955 



453 I X62953 



_454 I L34630 



455 I U43521 



Homo sapiens 
(subclone l_d6 from 
PI H54) DNA 
sequence 



0,55 



0,55 



Human rayristoylatcd 
alanine-rich C-kinase f 
substrate (MACS) 

5' end. ! 0,55 

R.norvegicus mRNA 
(pJGM6)with 

repetitive elements I 0.55 
Synecnocystis sp. 



mntABC transporter 
system: periplasmic- 
binding protein 

mntQ, complete cds;| 
(mntA) gene, 
complete cds; 
membrane protein 
(mntB) gene, 
complete cds. 



Plasmodium berghei 
merozoiie surface 
protein- 1 gene, 
complete cds 



0,55 



0,55 



DESCRIPTION 



<NONE> 



<NONE> 



P VALUE 



<NONE> 



<NONE> 



<NONE> 
<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



1076802 



2117632 



<NONE> 



127654 



<NONE> 



<NONE> 



|<NONE>| 



<NONE> 



l<NQNE>l 



<NONE> 



<NONE> 



(AE001220) V-type ATPase* 
subunit B (atpB-1) [Treponema 
J322710 I palliduml [ 5 . 0 



lextensin-like protein - maize 
l>gi[600118maYs1 | 5.0 



hydrogen dehydrogenase (EC 
1.12. 1.2) ~ Clostridium 
acetobutylicum >gi|557064 
(U 1 5277) hydrogenasel 
[Clostridium acetobutylicum] 



5.0 



MYOGLOBIN 



4.9 



WO 01/02568 
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SEQ 
ED 


Nearest 
ACCESSION 


Neighbor (BlastN vs. C 
DESCRIPTION 


jenbank) 
P VALUE 


Nearest Neifchl 
ACCESSION 


*>r (BlastX vs. Non-Redundant Pi 
DESCRIPTION 


rotcins) 
P VALUE 


456 


264937 


n. sapiens i^pU L/INA. 
clone 1 7g7, reverse 
readcpgl7R7.rtla, 


0.55 


417298 


MFS 18 PROTEIN 
PRECURSOR 


3.8 


457 


U109I4 


Macaco mulatto clone 
lriioj i -cen receptor 
alpha chain mRNA, 
partial cds. 


0.55 


310406 


(L09212) tat protein [Simian 
immunodeficiency virus] virus] 


3.8 


458 


AF022838 


Homo sapiens 
multidrug resistance 
protein 


0.55 


1585251 


traB gene [Amycolatopsis 
methanolical 


2.8 


459 


M35603 


Mouse Hox-3.1 gene 
andHox-3.2~Hox~3,l 
intergenic region. 


0.55 


818849 


(U25430) nucleotide 
pyrophosphatase precursor 
[Oryza saliva] 


2.0 


460 


AE001395 


Plasmodium 
falciparum 
chromosome 2, 
section 32 of 73 of 
the complete 
sequence 


0.55 


137532 


PROTEIN C2 

>gi|74386|pir||WZVZB6 59K 
Hindlll-C protein - vaccinia 
virus (strain WR) 


1.7 


461 


AE001395 


I~ ItiMIHJUlllin 

falciparum 
chromosome 2, 
section 32 of 73 of 
the complete 
sequence 


0.55 


137532 


PROTEIN C2 

>git74386|pir||WZVZB6 59K 
HindHl-C protein - vaccinia 
virus (strain WR) 


1.7 


462 


U59736 


Human transcription 
factor (NFATc.b) 
mRNA, complete cds 


0.55 


3327144 


(ABO 14565) KIAA0665 protein 
Homo sapiens) 


0.096 


463 


U348G0 


Saccharomyces 
ccrevistac origin 
recognition complex 
arge subunit (ORC1) 
zene, complete cds 


0.55 


140372 


HVPOTritl'ICAL 86TTKB 

PROTEIN IN GLK1-SR09 
INTERGENIC REGION 

>ffi!ftt 1 ^OlnirllS 1 0^7 

hypothetical protein YCL039w - 
/east (Saccharomyces 
:erevisiae) 


0017 


464 


1 

i 

( 

AF012341 ( 


Homo sapiens 
jIutaryl-CoA 
iehydrogenase 
GCDH) gene, exons 
>, 7. 8, 9, and 10 


0.55 


1 

3 
\ 

1166611 s 


U44674) coded for by C. ™ " 
rlegans cDNA yk27d9.5; coded 
or by C. elegans cDNA 
/k27d9.3; short region of weak 
loroology to drosophilia 
uppressor of sable protein 


O.OOS 



WO 01/02568 



PCT/US00/18374 





Nearest 


Neighbor (BlastN vs. Genbank) 


Nearest Neighbor fBlasrX v< Nrm-&«t,.nHnni Pmt,i„c^ 


SEQ 
ID 


ACCESSION 


1 DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


465 


AF004891 


HJV^i isolate Q98- 
CxA from Kenya, 
envelope 

glycoprotein C2V3 
region (env) gene, 
partial cds 


0.54 


<NONE> 


<NONE> 


<NONE> 


466 


Y 10159 


D.discoideum 
racGAP gene 


0,54 


<NONE> 


<NONE> 


<NONE> 


467 


AB001895 


Homo sapiens mRNA 
for B 120, complete 
cds 


0.54 


<NONE> 


<NONE> 


<NONE> 


468 


X 12357 


Bovine gene tor 
as party I protease 
NMlexons3and4> 
:: IcI|X12357 Bovine 
aspartyl protease 
NMlgene, exons3 
and 4. 


0,54 


<NONE> 


<NONE> 


<NONE> 


469 


AE0OU5i 


Borrelia burgdorferi 
(section 37 of 70) of 
the complete genome 


0.54 


<NONE> 


<NONE> 


<NONE> 


470 


X92052 


H.sapiens mRNA for 
T cell receptor alpha 
chain 


0.54 


<NONE> 


<NONE> 


<NONE> 


471 


U0093S 


Mas musculus ileal 
iipid-binding protein 
gene, complete cds 


0.54 


1009712 


(U27698) calreticulin 
[Arabidopsis thaliana] 


4.9 | 


472 


X68367 


M.thermoforrnicicum 
complete plasmid 
pFZl DNA 


0.54 


P5272 


casein Kinase ii. alpha- 
chain (CKII) 

>gi|4 19938|pir|[A43297 casein 
kinase H (EC 27-1.-) alpha 
chain - Theileria parva 
>gi|161871(M92084) casein 
kinase II alpha subunit 
[Theileria parva] 


4.7 


473 


) 
( 

261098 


K.sapiens CpG DNA, 
:lone 44c4, reverse 
cad cpg44c4.rtla . 


0.54 


4191274 


AJ131094)Xvent-lB protein 
Xenopus laevis] 


37 


474 


j 

5 

M63962 c 


uman gastric H,K- 
\TPase catalytic 
ubunit gene, 
omplcte cds. 


0.54 


( 
I 

3881648 { 


Z70757) similar to serine 
protease inhibitor 
Caenorhabditis elegans] 


37 


475 


I 

X86019 I 


-{.sapiens mRNA for 
'RPL-2 protein 


0.54 


( 

1648828 { 


D87963) ETF-related factoM 
ETFR 1) 


2.1 



WO 01/02568 
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Nearest Neighbor (BlastN vs. Gcnbank) 



SEQ 

JP {ACCESSION 



DESCRIPTION 
S.glaucescens genes 



Nearest Neighbor fBl astX vs. Non-RedunrW p^, t ) 
P VALUE I ACCESSION | DESCRIPTION 



P VALUE! 



stru, strX, strV and 
strW for 5'- 
hydroxystreptornyc in 
pruduction and 
[transport 
476 j X89Q1 Q polypeptides 



0.54 



3550345 



(AF084524) cellular repressor 
[of El A-stirnulatcd genes CREG | 
[Mas mutcuhg] 



0.25 



Homo sapiens mRNAI 
477 I AB0Q7836 [for Hic^ partial cds 0,54 



JORF 1 [Streptomyces 
.1097213 Jlavendulael 



478 



0.15 



IComamonas 
testosteroni TsaR 
paR), 

lioluenesulfonate 
methyl- 

Imonooxygenase 
joxygenase component] 
component (tsaB), 
toluencsulfonate zinc- 
[indepedent alcohol 
U32622 dehydrogenase... f 0.54 



(296047) DY3,6 
387535 1 [Caenorhabditis elegansl 



0.006 



479JD61394 



Arabidopsis thaliana 
gene for beta-VPE. 
[complete cds 



0.53 



<NONE> 



<NONE> 



<NONE> 



480 | D61394 



Arabidopsis thaliana 
gene for beta-VPE, 
Complete cds I 



481 I Z33072 



IMxapricoIum DNA 
[for CONTIG MC097 



0.53 



<NONE> 



<NONE> 



<NONE> 



0.53 



<NONE> 



<N0NE> 



<NONE> 



482 | U45975 



Human 

phosphatidyiirtositol 
(4,5)bisphosphatc 5- 
jphosphatase homolog 
jmRNA, partial cds. 
IS.cerevisiae ~~~ 



483 j Z71324 



[chromosome XIV 
reading frame ORF 
YNL048w 



0.53 



<NONE> 



<NONE> 



<NONE> 



484 I L3209O 



[Listeria 

[monocytogenes secA 
|gene, complete cds. 



M130 antigen (cytosolic variant j 
053 I 2135586 |n- hu man | 2 ,1 



0.53 



229 M 29 



(AF016415) No definition line 
Jfound [Caenorhabditis ejegans] 



070 



Is 



WO 01/02568 



PCT/US00/18374 



SEQ 
ID 


Nearest 
ACCESSICfr 


Neighbor fBlasiN vs. ( 

1 DESCRIPTION 
Mus musculus mRNyJ 


jenbank) 
P VALUE 


Nearest Neigh 
ACCESSION 


bor (BlastX vs, Non-Redundant P 
DESCRIPTION 


roteins) 
P VALUE 


485 


1 D86423 


for HGT keratin, 
partial cds 


0*53 




(X96713) collagen [Globodera 


0.41 


486 


YI5969 


kappa 21-6 gene, 
partial 


0,52 


<NONE> 


<NONE> 


<NONE> 


487 


M27480 


Mus musculus (clone 
3F9) transcribed 
germline T cell 
receptor gamma chain 
(Tcr-g) mRNA, VJ4 
G4 region. 


052 




(Z67990) Similarity to Rat 
amiloride-sensitive sodium 


4.6 


488 


D87004 


Human (lambda) 
DNA for 

immunoglobiin light 
chain 






(U37272) winged helix protein 
CWH-1 [Gallus gallus] 


3,5 


489 


299704 


Human DNA 
sequence from 
cosmid E75B8 on 
chromosome 22, 

comolcte seouence 
[Homo sapiens] 


0.51 


<NONE> 


<NONE> 


<NONE> 


490 


U76523 


Sambucus nigra lectin 
precursor mRNA, 
complete cds 


0.51 


<NONE> 


<NONE> 


<NONE> 


491 


U32795 


Haemophilus 
influenzae Rd section 
110 of 163 of the 
complete genome 


0-50 






<NONE> 


492 


M14602 


Human myoglobin 
sene, exon 2. 


0.49 


478384 


helicase homolog glOL protein - 
African swine fever virus 
>gi|414091 (X72951)G10L 125 
KDa protein 


7,0 


493 


] 
] 

D87075 


^luman mRNA for 
CIAA0238 gene, 
martial cds 


0,24 


i 

I 

1938429 i 


[U97602) similar to 
Schizosaccharomyces pombe 4- 
litrophcnylphosphatase 
PNPPASE) (SP.Q00472. 
vJID:g50O4) [Caenorhabditis 
ilegans) 


2,5 


494 


r 
F 

U95102 r 


<enopus laevis 
miotic 

>hosphoproteln 90 
nRNA. complete cds 


0.23 


<NONE> 


<NONE> 


<NONE> 



30 
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|SEQ 
_E> I ACCESSION 



Nearest Neighbor [BlascN vs. Genbank) 



496 X16399 

497 1 AE001251 




DESCRIPTION I p VALUE I ACCESSION 



N.crassa 
mitochondria] small 
(19$) rRNA and Cys 

tRNA. 

Gene for glutamatc 



0.23 



192150 



(L05670) clustrin (Mus 
muscuius] 



dehydrogenase (EC 
1.4.1.4), put. bacteriall 

origin J o.23 

Treponema pallidum 
section 67 of 87 of 

the complete genome I 0.22 



[(L07867) invariant surface 
[glycoprotein [Trypanosoma 
_2?0933 brucei] 



<NQNE> <NONE> 



5.1 



0.030 



L<NONE>| 



498 1 AF026919 


Homo sapiens 
amyloid lambda lighi 
chain variable region 
mRNA, partial cds 


J 

0.21 


I <NONE> 


j_ <NONE> 




499 J 227247 


D.melanogaster 
mRNA for defensin 


0.21 


1 <NONE> 


1 <NONE> 


<NONE> 


500 I Y15608 


Candida albicans 
UBI3 gene 


0,21 


<NONE> 


1 <NONE> 


1 <NQNE> 


501 1 V0059* 


Human beta-tubulin 
pseudoeene. 


0.21 


! <NONE> 


<NONE> 




I 502 1 X79426 


A.thaliana 
microsatellite 
{repeated motif 
(gai)7] 


0.21 j 


<NONE> 


<NONE> 


<NONE> 
|<NONE> 


503 1 X75772 


A.caerulescens 
mitochondrial genes 
for cytochrome b and 
NADH 

dehydrogenase 5 


0.21 1 


139626 


PROTEIN Tl PRECURSOR 


7,S 1 


I < 

1 s 

504 1 AF028736 r 


Jerratia marcescens 
ite specific 
ecombinase 


0 21 I 


|< 

|i 

|c 
|e 

If 

In 

3645960 IC 


;AL031583) 1- | 
:vidence=predicted by content; 
l-method=genefinder;084; 1- 
nethod_score=47.46; 1- | 
;vidence_end;2- 
vidence=predicted by match; 2- 
tiatch_accession=SWISS^ 
> ROT:P23792; 2- ! 
ialch_description=DISCONNE 

:ted protein - 2-1™^ 


4.6 


s 

505 1 X97545 |» 


xerevisiae OST5 
enc 


0.21 ( 


(AF014940) No definition line 
2275631 [found [Caenorhabditis eieeansl 


7.7 



>1 



WO 01/02568 



PCT/US00/18374 





! Nearest Neighbor (BlastN vs. Genbank) 


Nearest Nejghbor f BlastX v* Nn^u^.i^^* o^.^ , " 


SEQ 
ID 


ACCESSIOh 


I DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















506 


M24543 


Human prostate- 
specific antigen (PA) 
gene, complete cds. 


0.21 


1938527 


(U97012) C04E6.2 gene 
product [Caenorhabditis 
elegansl 


2.7 


507 


M62470 


Mouse 

thrombospondin 
(THBSi)gene. 
complete cds. 


0.21 


548563 


POL YPROTEIN 2.7.7.48) 
Erysimum latent virus 
>gi|3892232 (AF098523) 
replicase protein [Erysimum 
latent virus] 


2.1 


508 


Y13544 


Homo sapiens cosmid 
CI 


0.21 


1235710 


(L40584) polyprotein 
[Infectious pancreatic necrosis 
virusl 


9 ft 


509 


M24193 


Chicken MHC B 
complex protein (C12 
3) mRNA, complete 
cds. 


0,21 


3600102 


(AF090441) extracellular reelin 
[Gallus gallus] 


0.52 


510 


X97161 


H.sapiens ThE3 gene, 
exon 4 ? 5 & 6 


0-21 


854065 


(X83413) U88 [Human 
herpesvirus 6] 


0.30 


511 


X67649 


R.norvegicus DNA 
sequence for 
LFB1/HNF1 
promoter 


0.21 


3913114 


TRANSCfttfTlON PACtOR 
COUP 2 COUP-TFII - chicken 
>gi|392817 (U00697) orphan 
receptor COUP-THI [Gallus 
eallus] 


0.004 


512 


U63807 


Fugu rubripes growth 
hormone (GH) gene, 
complete cds 


0,21 


3510505 


(AF030881) pol polyprotein 
[Fueu rubripes] 


3e^04 


513 


Z95636 


H.sapiens mRNA for 
laminin alpha 5 chain 


0.21 


400350 


NAM'/ PROTtlN (NONSENSE 
MEDIATED MRNA DECAY 
PROTEIN I) (UP- 
FRAMESHIFT SUPPRESSOR 
I) factor NAM7 - yeast 
'Saccharomyces cerevisiae) 
>gi|4CP3 


le-07 


514 


1 

U91907 < 


Ivliroungu leomna 
major 

tostocompatibility 
:omplex class D 
PQA) gene, partial 
:ds 


0.20 


<NONE> 


<NONE> 


<NONE> 


515 


1 

Z35758 


transmissible 
lastroenteritis virus 
PFI virion protein 
senes 


0.20 


<NONE> 


<NONE> 


<NONE> 


516 


s 

X0O334 s 


Drosophtla virilis 
imple DNA 
equence(pDv-19) 


0.20 


<NONE> 


<NONE> 


<NONE> 



3<o5 



WO 01/02568 



PCT/USOO/18374 



ID 


Nearest 

i 

ACCESSIOr 


Neighbor (BlastN vs. 
4 DESCRIPTION 


Genbank) 
P VALUE 


1 Nearest Neigh 
| ACCESSION 


bor (BlastX vs. Non-Redundant P 
DESCRIPTION 


roteins) 
P VALUE 


517 


M76741 


Homo sapiens biliary 
glycoprotein (BGP) 
gene, partial cds. 


0,20 


I <NONE> 


<NONE> 


<NONE> 


518 


D785I5 


Mus musculus rae28 
gene, exon I and 
5 flanking region 


0.20 


1 <NONE> 


<NONE> 


<NONE> 


519 


M62975 


Drosophila 
melanogaster RNA 
polymerase II second 
largest subunit 
upstream (DmRP 
140) gene, exons 1-4. 


0.20 


1 ' 

f <NONE> 


<NONE> 


<NONE> 


520 


M27260 


Chicken 78-kD 
glucose-regulated 
protein, complete cds. 


0.20 


1 <NONE> 


<NONE> 


<NONE> 


521 


AF076470 


Rice tungro 
bacilliform virus 
Serdang strain, 
complete genome 


0,20 


<NONE> 


<NONE> 


<NONE> 


522 


AF076470 


Rice tungro 
bacilliform virus 
Serdang strain, 
complete genome 


0,20 


<NONE> 






523 


U04636 


uman 
cyclooxygenase-2 
(hCox^2) gene, 
complete cds. 


0.20 I 


<NONE> 


<NONE> 


<NONE> 


524 


c 

5 
t 

AE001430 s 


asmodium 
alciparum 
:hromosome 2, 
action 67 of 73 of 
he complete 
equence 


020 [ 


<NONE> 


<NONE> 


<NONE> 


525 


h 

V 

( 

AF043514 c 


A\j& musculus 
hosphomannomutase 
Pmm2) mRNA, 
omplete cds 


0.20 1 


I 
h 

3025006 [ 


VPOTHCTICaL 15.5 KD 

'ROTEIN IN MOAE-RHLE 
NTERGENIC REGION 
gi|1787009(AE00018l)orf, 
ypothetical protein 
Escherichia coli] 


9,5 



WO 01/02568 



PCT/USOO/18374 



Nearest Neighbor (BlastN vs. Genbank) 



Nearest Neighbor (BlastX vs. Non-Redundant ProteinO 



SEQ 
ID 



ACCESSION 



DESCRIPTION 



RVALUE I ACCESSION 



DESCRIPTION 



526 



U23144 



Xenopus laevis FTZ 
Fl -related nuclear 
orphan receptor 
variant (xFFlrAshort) 
mRNA. complete cds, 



0.20 



3184402 



(AB014477) period protein 
Chymomyza gpstata] 



nyza costata] 
lABLb M1LKUMJMAT 
SIGNAL PEPTIDASE 23 KD 
SUBUNIT (SPC22/23) 
>gi|630688|pir|(S448S4 
K12H4.4 protein - 
Caenorhabditis eJegans 
>gi|2897Q8 (LI4331) homology 
with signal peptidase; coded for 
by C, elegans cDNAs GenBank; 
M79661, M79662 and M79663; 
putative 



9,6 



527 



U14621 



Paracentrotus livid us 
Pax-6 (suPax-6) 
mRNA, complete cds. 



0.20 



465894 



7.7 



528 



AF0305U 



Actinobacillus 
pleuropneumoniac 
MRP ATPase 
homolog (mrp) gene, 
partial cds; ApxIVA 
var3 (apxIVA) gene, 
complete cds; and 
beta-galactosidase 
(lacZ) gene, partial 
cds 



0,20 



1175966 



HYPOTHETICAL 45.3 KD 
PROTEIN IN THIS 5'REGION 
>gi[ 1 084720|pir||S56 193 
probable membrane protein 
YFL062w - yeast 
Saccharomyces cere visiae) 



529 



AF070581 



Homo sapiens clone 
24540 mRNA 
sequence 



0.20 



542394 



lyoxal oxidase (EC L2.3.-) 
precursor - basidtomyceie 
{Phanerochaete chrysosporium) 
>gi|105O3Q2 ■ 



7.2 



5.8 



530 



X75437 



T.maritima pgK gene 
for 3^ 

phosphoglycerate 
kinase 



0.20 



825648 



Z34531) coproporphyrinogen 
oxidase [Homo sapiens] 



5.8 



531 



U32686 



Haemophilus . 
influenzae Rd section 
1 of 163 of the 
complete genome 



0,20 



3309593 



(AF072878) ciliary outer arm 
dyncin beia heavy chain 



5,6 



532 



Z28081 



cerevisiae 
chromosome XI 
reading frame ORF 
YKL08lw 



0.20 



2507201 



CARBON CAYaBGlITE 
DEREPRESSING PROTEIN 
KINASE >gi| 1469803 (L78129) 
serine/threonine kinase [Candida 
albicans] 



%1 



5.5 



WO 01/02568 



PCT/US00/18374 



Nearest Neighbor (BlastN vs. Gcnbank) 

P VALUE 



10 IACCE5SIQn| DESCRIPTION 
lordcum vulgarc 



[limit dcxtrinasc 
(HvLD99) gene. 
533 I AF022725 complete cds 



IDrosophila 
mdanogastcrcosmid 
534 1 AL021726 171E4 



DESCRIPTION 



P VALUE! 



0.20 



0.20 



536 



jH.sapiens HLTF gene 
I for helicase-Iikc 
Z46606 [transcription factor 



537 



538 



(Clostridium 
perfringens C beta 2 
toxin gene, complete 
L77965 cds 



R1BUSOMALPKUIMN 
L30B (RP29) cytosolic - yeast 
(Saccharomyces cerevisiae) 
>gi|Z7l821 not determined) 
[Saccharomyces cerevisiae] 
>gi|1045254 cerevisiae] 
>gi| l323250|gnl|PlD|e243708 
(Z72933)ORFYGR148c 
[Saccharomyces cerevisiae] 



0,20 



IDNA-REPAIR PROTEIN 
XRCC1 



HymiHkllC'AL KJD 
PROTEIN IN SEC53-ACT1 
INTERGENIC REGION 
>gi|I084703|pirj|S562ll 
probable membrane protein 
YFL044c - yeast 
(Saccharomyces cerevisiae) 
>gi|8367Il|gnI|PID|di009835 
|(P50617) YFL044C 



[Chicken neural cell 
adhesion molecule (N 
539 1 M15938 |C AM) gene, exon 1 8. 



2133082 



jregulatory protein MS R1 - yeast 
HVpOTOETKaL 5x9 KB 



Solanum tuberosum 
rnRNA for ex tens in- 
540 1 AJ0Q3220 [like protein, partial 



PROTEIN C56G2.1 IN 
CHROMOSOME III 
>gi|7264l3 (U23I77)C56G2.1 
Igene product [Caenorhabditis 



U 



EPIDERMAL (jROWTH 
FACTOR PRECURSOR 
precursor - mouse >gi|3092l0 
(J003S0) prepro~egf [Mus 
Imusculusj 



WO 01/02568 



PCT/USOO/18374 



SEC 
ID 


f Neares 
>| 

C I 

JACCESSIOI 


t Neighbor (BlastN vs. 
^ DESCRIPTION 


Genbank) 1 
P VALUE | 


Nearest Neighbor (BlastX vt Nnn^edundant F 

ACCESSION 1 DESCRIPTION 


Voteins) 
P VALUE 




1 a on iii it\ 
J ApUIU79 


Homo sapiens mRN/ 
for KIAA0607 
- P rotc ^ n * Partial edg 


1 ! 

0.20 1 


_ 2143753 


jgene VUF protein - rat 
|>gi|205690 (M60525) nerve 
growth factor inducible protein 
[Rattus norvegicus] >gi(20570l 
(M60522) nerve growth factor- 
inducible protein (Rattus 

[norvegicus] >gi[207651 

IRN A binding protein rnpD - 
lArabidopsis thaliana (fragment) 
>gi|5l0240(X6U08)RNA 
binding protein [Arabidopsis 
1 thaliana] 




3** J 


1 X753I8 


Rsapiens ITIHI gem 
(exon 22) and ITTH3 
gene 


j 1 
0.20 1 


629557 


_ 0.39 
0.38 




AB008374 


Oncorhynchus rnykiss 
mRNAfor alpha 3 
type I collagen, 
partial cds 


0.20 1 


1082610 


Imufl protein - human 
>gi|762953 (X86018) mufi 
JfHomo sapiens] 


0.37 


545 


U09809 


Limulus polyphenols 
arginine kinase 
mRNA, complete cds. 


0,20 I 


J 8820 to 


(AJ012650) CP [Papaya 
ringspot virus] 


037 


546 


AB020671 


Homo sapiens mRNA 
forKIAA0864 
protein, partial cds 


0.20 1 


2674350 


i(U93i21)M-phase 
phosphoprotein-1 [Homo 
sapiens] 


0.1S 


547 I 


1 
i 
< 

L04457 s 


r iiyiupiiuiora 
megasperma 
mitochondrial 
ORFI52, complete 
cds, cytochrome c 
Dxidose subunii I 
;coxi)gene t 
romplete cds, 
:ytochrome c oxidase 
►ubunit II 


0.20 I 


746516 : 


[U235 17) D 1022.7 
Caenorhabditis elegans] 
>gi|3258651ele^ans] 


0.043 


54S J 


1 
r 
r 
( 

c 
c 

( 

c 
c 

L04457 s 


J hytopnmora 
negasperma 
nitochondrial 
)RF152, complete 
ds, cytochrome c 
xidase subunit I 
cox 1) gene, 
ompleie cds, 
ytochrome c oxidase 
ubunit II J_ 


020 1 


( 
[ 

7465 16 |> 


U23517)D1022.7 | 
Caenorhabditis elegans] 
gi|3258651 elepns] 


0.042 
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WO01/02568 



PCT/US00/18374 



SEQ 

jp I ACCESSION 




WO 01/02568 



PCT/US00/18374 



Nearest Neighbor (BlastN vs. GenbanR) 



ACCESSION DESCRIPTION 



Nearest Neighbor (BlastX vs. Non-Redundant IWi^T 



P VALUE 1 ACCESSION 



DESCRIPTION 



P VALUEl 



567 



571 



Tetrahymena 
thermophila 
Itelomerase 
component p80 
U25641 ImRNA * complete cds 



0.19 



[Human mRNA for 
KIAA0345 gene, 
568 1 AB0Q2343 jcomplete cds 



<NONE> 



<NONE> 



<NONE> 



0.19 



<NONE> 



<NONE> 



<NONE> 



569 1 D10064 



570 1 U31734 



Erwinia carotovora 
gene for pectate lyase 

[III, complete cds 
'nomo sapiens clone 
IMF118A4AI0 
Ihypoxanthine 
phosphoribosyltransfe 
rase(hprt) 130 kb 
deletion mutant 
ImRNA, partial cds, 
[contains human Alu 
element 



0.19. 



<NONE> 



<NONE> 



<NONE> 



Plasmodium 
(falciparum 
chromosome 2, 
section 23 of 73 of 
(the complete 
AE001386 {sequence 

Ikomo sapiens 



049 



<NONE> 



<NONE> 



<NONE> 



0.19 



<NONE> 



<NONE> 



<NONE> 



572 I M95623 



Ihydroxymcthylbilane 
[synthase gene, 
complete cds. 



0.19 



<NONE> 



<NONE> 



<NONE> 



$67478 



(GC*IS)=vitamin D 
[binding protein/group 
specific component 
[human, peripheral 
blood leukocytes. 
Genomic, 794 m. 
Segment 4 of 9] 



0.19 



X99075 



iH.sapiens NRGN 
| gene, exon 1 



<NONE> 



<NONE> 



<NONE> 



AF044775 



I Homo sapiens 
breakpoint cluster 
region BCRdcrI4 

[sequence 



0.19 



<NONE> 



<NONE> 



<NONE> 



0, 19 



<NONE> 



<NONE> 



<NONE>| 
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Nearest 


Neighbor (BbstN vs. Gcnbank) 


[ Nearest Neighbor (BlastX vs. Non~Redundant PmieinO 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Human mJKJNA tor 










576 


AB002333 


KIAA0335 gene 
complete cds 


0.19 


<NONE> 


<NONE> 


<NONE> 


577 


U53566 


Macaca mulaita pit- 
1/GHF- 1 

transcription factor 
mRN A, complete cds 


0.19 


1078068 


probable membrane protein 
YLR3 lie -yeast 


9,2 


578 


U73664 


Human 

t(lU4)(<,13;q32) 
breakpoint junction 
sequence 


0.19 


' 116734 


COAT PROTEIN (CAPS ID 
PROTEIN) virus >gi|5890i 
(X62133) CyMV coat protein 
gene product 


8.8 


579 


AF004054 


Reterophyllaea 
pusiulata rpsl6 gene, 
chloroplast gene, 
partial intron 
sequence 


0.19 


1928991 


(U92815) heat shock protein 70 
precursor [Ci trull us lanatus] 


8.7 


580 


227081 


Caenorhabditis 
elegans cosmid 
M01A8. complete 
sequence 

Cut* nivha hH i t i « 

^ UC IIUI I la UU Ilia 

elegans] 


0.19 


2496247 


H YFU 1 Hb l'IC AL A 1 P- 
BINDING PROTEIN MJ0625 
>gi|2l28413|pir||A64378 
hypothetical protein MJ0625 - 
Methanococcus jannaschii 
>gijl591336 (U675lO)lVL 
jannaschii predicted coding 
resion MJ0625 


8.6 


581 


Z74145 


S.cerevisiae 
chromosome IV 
reading frame ORF 
YDL097c 1 


a 19 


1174425 


TYROSINE-PROTEIN 
KINASE SPK-1 


6.7 


582 


< 

D38547 < 


Small round 
structured virus 
genomic RNA, 
3'terrninaJ sequence 
containing ORF2 and 
3RF3 


0.19 


971318 


[Z48053) putative protein 
Bovine herpesvirus 1] 


5.1 



y?3 



WO 01/02568 
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SEQ 
ID 


Nearest 
ACCESSIOh 


Neighbor (BlastN vs. < 
I DESCRIPTION 


Uenbank) 
P VALUE 


Nearest Neteh 
ACCESSION 


bor(BlastX vs, Non-Redundant P 
DESCRIPTION 


roteins) 
P VALUE 


583 


D88000 


-pna lasfibos^mai 

RNA > 

dbj|D88002|D88002 
Ralstonia eutropha 
DNA for 16S 
ribosomal RNA > 
dbj|D88003|D88003 
Ralstonia eutrftnha 
DNA for 16S 
ribosomal RNA > :: 

HKilnfifiDTLdlDflfi/VLf 
ODJ|i^ooUU4jL/ooUlH 

Ralstonia eutropha 
DNA for 16S 
ribosomal RNA 


0,19 


3800952 


(AF100657) No definition line 
found [Caenorhabditis elegans) 


5.1 


584 


UO/*fO* 


Methanococcus 
jannaschii section 4 
of 150 of the 
complete genome 


0,19 


3183617 


(AJ005586) MYB-relatcd 
transcription factor 
[Antirrhinum majus] 


4,0 


585 


L23906 


GaHusdomesticus 
microsatellite DNA 
marker. 


0.19 


1947094 


(U93074) voltage-gated sodium 
channel homolog BdNal 








Helicobacter pylori, 
strain J99 section 23 
of 132 of the 
complete genome 


0.19 


1730177 


GLUCOSE-6-PHOSPHATE 
ISOMERASE (GPI) 
ISOMERASE) (PHI) 
>gi|2118333|pir||l43073 glucose 
phosphate isomerase - Chinese 
iamster >gi|987046 griseus] 


3.9 


587 


< 

1 

M 19460 


P.puttda catBC 
Dperon encoding 
:is,cis-muconate 
actonizing enzyme I 
ind muconolactone 
somerase genes, 
-omptete cds. 


0.19 


] 
( 
] 

I 

c 

3873843 I 


[Zozz^oj cuina fc^l 
yk251g7.3 comes from this 
gene;cDNA EST yk251g7.5 
:omes from this gene; cDNA 
EST EMBL:D68223 comes 
from this gene; cDNA EST j 
EMBL:C 12737 comes from this, 
gene; cDNA EST yk389c8.5 
;omes from this gene; cDNA 


3.9 


588 


i 

U22349 s 


fetrahymena australis 
elom erase RNA 
!cne. complete 
equenee 


0.19 


( 

4105782 


AF049922)PGPL69-i2 
Peiunia x hybrida] 


3.2 



*>7<f 
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PCT/USOO/18374 



Nearest Neighbor (BlastN vs. G enbank ) I Nearest Neigh bor (BlastX vs , Non-Redundint Proteinir 



seqI 

IP 1 ACCESSION 1 DESCRIPTION 



597 I M69053 



iD.melanogastcr 
calcium-activated K+ 
J channel subunit 



598 I AF076279 



IDictyosielium 
Ifirraibasis plasm id 
iDfpl, complete 
[plasm id sequence 



P VALUE | ACCESSION 



DESCRIPTION 



0.19 



1707984 



KjLU 1 AMA Ik SYNTHASE 
(FD-GOGAT) 
>gi(2i26524|pir||S60228 
glutamate synthase (ferredoxin) 
|(EC 1.4.7.1) gltB - 
Synechocystis sp. (PCC 6803) 
>gi|5 15938 (X80485) glutamate 
[synthase 



(U00008) yejA [Escherichia 
0. 19 | 4539 86 Icoli) 



0.80 



599 I D28873 



Mouse MCNP gene 
for C-type natriuretic 
peptide, complete cds 
|(exonl,exon2) 



0.19 



2650444 



(AE001092) acetyi-CoA 
synthetase (acs-1) 
[Archaeoglobiis fulgidus] 



0.63 



600 I U06071 



lOxynicha nova 
Jmacronuclear actin IJ 
gene, complete cds 



601 I L54057 



[Homo sapiens CLP 
IrnRNA, partial cds. 



[complement control protein 
0.19 I 1584024 [Boiryllus schlosseri] 



„ [(AL022374) putative ABC 

0-19 | 3036883 [transporrer 



0,48 



602 1 X89806 



iPJividius cDNA for 
COLL2aIpha gene 



603 I AB001104 



jArchaeoglobus 
[fulgidus section 3 of 
172 of the complete 
I genome 



0.19 



3638957 



[(AC004877) sco-spondin-mucin 
like; similar to P98167 uncertain 
[Homo sapiens] 



0.46 



0.41 



604 I U54501 



iRatrus norvegicus 
microsatcllite 
[sequence D0Mco22 



(Yl 1739) transcription factor 
019 I 2315192 [[Homo sapiens] 



605 I X74468 



606 1 U2Q285 



Human 

papillomavirus type 
15 genomic DNA 



[Human Gpsl (GPS1) 
mRNA. complete cds 



607 | D494Q8 



[Human gene for 
interlcukin 3 recepior 
alpha subunii, exon 
10 



0.19 



228951 



ID-MeAsp 

jreceptor:ISOTYPE=epsilon3 
l[Mus musculus] 



035 



0.32 



0.19 



3695390 



(AF096371) contains similarity 
to Rattus norvegicus cyclin G- 
associated kinase (SW:P97874) 
IfArabidopsis thaliana] 



0.19 



2582659 



(AJ002527) gluciiol-6- 
phosphate dehydrogenase 
[[Clostridium beijerinckiij^ 



0,19 



252236S 



(AF008596) alpha 1, 3- 
fucosyltransfcrase [Helicobacter 
Ipylori] 



0.28 



0.27 



0.16 
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PCT/US00/18374 



SEC 
ID 


Nearest 
ACCESSION 


Neighbor (BlastN vs. ( 
I DESCRIPTION 


3enbank) 
P VALUE 


i Nearest Neish 
L_ ACCESSION 


bor (BlastX vs. Non-Redundant P 
DESCRIPTION 


roteins) 

P VAT rrc 
* VALUE 


608 


AF04U41 


Homo sapiens 
pituitary specific 
homeodomain proteii 
(PROPi) gene, exon 
3 and complete cds 


i 

1 0.19 


1 37403 


(X0354I) trk gene product (aa 1 
641) [Homo sapiens] 


0.091 


609 


L12531 


Discopyge ommata 
Ca2+ channel alpha 1 
subunic gene 
sequence. 


0.19 


! 3618274 




0.069 


610 


AF052445 


Yellow fever vims 
clone HONG9 
polyprotein gene, 
complete cds 


0.19 


I 1932822 


(1/15928) KH-domain putative 

PW A kirvlinn AW\ia<n 


0.00 1 


611 


Z36946 


B.anthracis sap gene 
encoding S-layer 
protein 


0.19 


1 173241 


(L06487) ZIP1 protein 
Saccharomyces cerevisiae] 


2e-04 


612 


AF087984 


Homo sapiens full 
length insert cDNA 
clone YW29A12 


0.19 


3786014 


(ACO05499) hypothetical 
protein [Arabidopsis thalianal 


le-06 


613 


AEOOlOlO 


Archaeoglobus 
fulgidus section 97 of 
172 of the complete 
genome 


0.19 


3135493 


(AF060248) unknown 
Arabidopsis thalianal 


7e-08 


614 


L08965 


Trichosporon 
cutaneum carbamoyl 
phosphate synthetase 
large subunit (argA) 
gene, partial cds. 


0.19 j 


1086901 


(U41278)F33G12.3 gene 
product [Caenorhabditis 
elegans] 


2e-08 


615 


M91466 < 


Rattus norvegicus 
A2b-adenosine 
receptor mRNA, 
complete cds. 


0.19 1 


2984320 


[AE000773) acetoin utilization 
>rotein [Aquifex acolicus] 


6e-09 


616 


< 

X95971 


i.lividans groEL2 
icne 


0.19 J 


] 
1 

) 

c 

3925277 


,Aluj264J) similar to 
Jncharacterized protein family 
LTPF0034, Double-stranded 
*NA binding motif; cDNA EST 
/k489b3,5 comes from this 
icne; cDNA EST yk439g7.5 
;omes from this gene 
Caenorhabditis elegans] 


7e-10 


617 


F 

U 12539 


Jchizosaccharomyces 
>ombe scd2 (scd2) 
enc, complete cds. 


0.19 1 


f 

F 

1938549 ( 


U97016) Similar to drosophila 
lie I gene product ribosomal 
>rotein L4 (YML4) 
NID:g459259) 


3e 14 



WO 01/02568 
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SEQ 



Nearest Neighbor (BlasiN vs. Genbank) 1 Newest Neighbor (BlastX vsTl^ -Redundant Proteins) 



I ACCESSION I DESCRIPTION 



618 



[Schizosaccharomyces 
[pombe scd2 (scd2) 
U 12539 [gene, complete cds 



P VALUE | ACCESSION 



DESCRIPTION 
[(U97Q16) similar to drosoprnTa 



0.19 



1938549 



Rlcl gene product ribosomal 
protein L4 (YML4) 
|(NID;g45 9259) 



P VALUE 



619 



Human DNA 
I sequence from 
cosmid U25D11, 
between markers 
DXS366 and DXSS7 
Z68327 [on chromosome X 



620 



Dictyostelium 
Idiscoideum 
lORFvegl 14 mRNA, 
U66525 complete cds 



0.19 



3875774 



lEMBL;D32434 comes from this 

gene; cDNA EST 

EMBL.D337I0 comes from this 

gene; cDNA EST 

EMBL;D34467 comes from this 
Igene; cDNA EST 

EMBL:D350Q5 comes from this 
Igene; cDNA EST 
|EMBL:D37535 comes from this 

gene; ... 

>gi|3878710|gnl|PID|el348373 
EST EMBL.D33710 comes 
from this gene; cDNA EST 
|EMBL:D34467 comes from this 
gene; cDNA EST 
EMBL:D35005 comes from this 
gene; cDNA EST 
EMBL:D37535 comes from this 

e ene; - 



0.19 



3540281 



(AF056U6) AII-1 related 
[protein [Fugu rubripes] 



621 



Newcastle disease 
(virus isolate Herts/33 
(matrix protein 
JJ25S30 mRNA. complete cds 



622 



(Mus rousculus strain 
BALB/cdelta- 
aminolevulinic acid 
(dehydratase (Lv) 
_U89407 ImRNA, partial cds 



0.19 



(U9386S) RNA polymerase III 
2228750 summit [Homo sapiens] 



0.19 



623 



(Bison bison 
athabascae 
_AF095598 Imicrosatellite BBJ 2 



1825764 



0.18 



<NONE> 



624 I AF064260 



Strongylocentroius 
purpuratus SRC8 
|mRNA. complete cds 



0.18 



<NONE> 



(U88314) C46H1 1.1 Igene 
product [Caenorhabditis 
elegans] 



6e-15 



2e-l7 



le-18 



<NONE> 



<NONE> 



3e-25 



<NONE> 



<NONE> 



WO 01/02568 



PCT/USOO/18374 



| ID 


Nearest Neighbor (BlastN vs. Genbank) 
! I 1 
\ ACCESSION 1 DESCRIPTION | p VALUE 


- Nearest Neighbor (BlastX vs. Non.RgrhinH™r p 
ACCESSION 1 DESCRIPTION 


rote ins) 
P VALUE 


625 


U69533 


[Arabidopsis thaliana 
AtKAP alpha mRNA 
complete cds 


I 0.18 


J <NONE> 


j <NONE> 


<NONE> 


626 


D89041 


[Bovine DN A for 
[prostaglandin 
F2alpha receptor. 
1 partial cds 


0.18 


j <NONE> 


j <NONE> 


<NONE> 


627 


M24571 


piciyostelium 
discoideum tRNA- 
IGuj-GAA gene, clone 
yGluGAA7, 


0.18 


J <NONE> 


I <NONE> 


_<NONE> 


628 


! X59772 


ID.melanogasEer ovo 
Igene required for 
female germ line 
[development 


0.18 


<NONE> 


1 <NONE> 


<NONE> 


629 


AL010209 


Plasmodium 
falciparum DNA *** 
SEQUENCING IN 
PROGRESS *** 
from contig 3-104, 
complete sequence 


0.18 


<NONE> 


<NONE> 


<NONE> 


630 


U67575 


IMethanococcus 
ijannaschii section 117 
of 150 of the 
complete genome 


0.18 I 


111839 1 


inositol 1,4,5-triphosphate 
receptor 2 - rat 


8,5 


631 


U28730 


Caenorhabditis i 
elegans cosmid I 
K10B2 I 


0,18 1 


1787604 


(AE000232) orf t hypothetical 
arotein [Escherichia coli] 


8,3 


632 


X99798 


L.lactis pepFl & j 
pepF2 genes 1 


0.18 I 


T 
li 

3406624 II 


(AF079U0) glycosomal malate 
dehydrogenase [Trypanosoma 
^ruceil 


8J 


633 


AF025306 


Danio rerio band 4. M 
like protein 4 (nb!4) 
mRNA, complete cds | 


0.18 | 


fKOBAjbLE NUCLEAR 
ANTIGEN herpesvirus 1 (strain 
Kaplan) >gi|334072 (M34651) 
IORF-3 protein (Pseudorabies 
465445 virus) 


7.9 


634 


AF059251 I 


Mus musculus | 
ipoxygenase (alox) 
mRNA, complete cds | 


0.18 1 


( 

1655667 h 


Z81368) hypothetical protein 
tv2393 


6.6 


635 


< 

222605 | 


3.domesticusCTCF 
jrotein mRNA. 


0.18 


3 

481864 Id 


-methyI*2-oxobutanoate 
ehydroaenasc 


6.6 


636 


11 
li 

AB0I10S6 |f 


4omo sapiens mRNAj 
orKIAA0514 | 
)roiein. complete cds [ 


0.18 1 


( 

3874158 |c 


281464) predicted using 
ienefinder 


6.4 



31 q 



WO 01/02568 
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(sEQ 
ID 


JL Nearest 


Neighbor (BlastN vs. Genbank) 


| Nearest Neighhrn-zRhcrYvc m — p^.^ — D ..„ _ ■ 


J J 

Jaccess^ 


4 DESCRIPTION 
Laenorhabditis 


P VALUE 


1 ACCESSION 


DESCRIPTION 


P VALUE 


637 


I 278536 


elegans cosmid 
C07A4, complete 
Sequence 
[Caenorhabditis 
elegans] 


0.18 


I 3702121 


(AJ01 1681) retinoblastoma- 
related protein [Chenopodium 
rubrum] 


6.4 


638 


1 U67530 


Methanococcus 
jannaschii section 72 
of 150 of the 
complete genome 


0.18. 


| 3877946 


(ZK11W4) Weak similarity to 65 
KDA heat shock protein 
(TR:G602231);cDNAEST 
EMBL:D717Q5 comes from this 
gene; cDNA EST 
EMBL:D74382 comes from this 
gene [CaenorhabdiD's elegans] 


6.3 


639 




Influenza 

A/Duck/England/1/62 
(H4N6) nucleoprotein 
mRNA, complete cds. 


0.18 


3873663 


frb96J4)cUNAlAl — 6 

EMBL:D71510 comes from this 
gene; cDNA EST 
EMBL:C08449 comes from this 
gene; cDNA EST yk266bI2.3 
comes from this gene; cDNA 
EST yk266bl2.5 comes from 
this gene' cDNA FST 
yk461h7.3 comes from this 
gene; cDNA,.. 


6.2 


640 1 


M7378I 


Oryctolagus 
cuniciilus integrin 
bcta-S subunit 
mRNA, complete cds. 
> gb(I44828|I44828 
Sequence 3 from 
patent US 5635601 


0.18 1 


i 

1362129 < 


najor allergen OLE 17- 
rommon olive 1 


5.8 


641 I 


X67219 


D.melanogasterRop 
?ene ... 


o:i8 i 


( 

3449286 


AB011527) MEGF1 [Rattus 
lorvesicus] 


4,8 


642 1 


I 

AF 106941 c 


-lomo sapiens beta- 
irrcstin 2 mRNA, 
omplete cds 


0.18 1 


I 

548353 u 


PRQTEIN-PI1] 
JRIDYLYLTRANSFERASE 
inelandii >gi|39257 (X59610) 
ridylyl transferase 


3.7 


1 643 1 


I 
h 

AF052602 r, 


>anio rerio 
unnngtin (HD) j 
iRNA, complete cds 


0.18 1 


P 

[i 

241058 s 


otential IGF binding protein 
chickens, Peptide Partial, 77 aa, 
sgment2of 3] 


3.6 
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|SEQ 



Nearest Neighbor (BlastN vs. Genh:,nH 



-ff. I ^CESSION DESCRIPTION 



Jfcgre!LNcighb Qr (BlastX vs. Non-Redundant ProteinsT 



P VALUE I ACCESSION 



DESCRIPTION 
CRT predicted using"" 



_644 | AB020709 



Homo sapiens mRNA| 
for KIAA0902 
>tem, complete cds I 



645 1 AJFQ96883 



iV-1 isolate patient 
|3 country USA pol 
polyprotcin (pol) 
gene, partial cds 



0.18 



3875570 



Ujenermaer; cDNA EST 
|EMBL:M75775 comes from this 
gene; cDNA EST 
EMBL M89255 comes from this 
gene; cDNA EST 
EMBL:M89127 comes from this 
(gene; cDNA EST 
EMBL:T0014I comes from this 
gene; cDNA EST EMBLrT, 



646 I L39928 



Pyrocoelia miyako 
(clone pB-PmL41) 
lucifcrase mRNA, 
complete cds 



0.18 



3250696 (AL024436} putative pr^r-in 



0.18 



(AC003974) unknown protein 
2914702 IfArabidopsis thaliana] 



_647 j M17082 



(Human 

Icarcinoembryonic 
jnonspecific 
jcrossreacting antigen 
(CEA;NCA)gene i 
lexons 1 and 2. 



0.18 



648 1 X75318 



Rsapiens ITIHl genej 
(exon 22) and JTIH3 
Jgene 



0.18 



REGULATORY PROTEIN 

1351833 AB AA 

JAMA-Dinaing protein rnpD - 
JArabidopsis thaliana (fragment) 
>gi|5 10240 (X61 108) RNA 
Jbinding protein [Arabidopsis 

629557 thaliana! 



649 | AFQ U9Q8 



[Mux musculus 
lapoptosis associated 
tyrosine kinase 
j(AATYK) mRNA, 
[complete cds 



0.18 



330442 



(K03332) nuclear antigen 2 
[Epstein-Barr virus] 
TASPAR 1 y L- I'RN A 



650 j UQ4004 



Simian 

limmunodeficiency 
virus SIVagrnVER-2 
(envelope protein 
[gene, partial cds. 
jXenopus laevis 



0.18 



135102 



SYNTHETASE aspartate- 
tRNA ligasc(EC 6.1.1. 12) - 
Escherichia coli coll] 
>gi|l736513|gnl|PID|dl01640l 
(D90829) Aspartate-tRNA 
ligase(EC 6.1*1.12) 
[[Escherichia coli] 



IRanGTPase 



g VALUEl 



2.1 



1.7 



0.73 



0,72 



0.41 



5e-04 



6c- 11 
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SEQ 
ID 


I Nearest 
1 

lACCESSlO 


Neighbor fBlastN vs. ( 
I DESCRIPTION 


jenbank) 
P VALUE 


1 Nearest Nciph 
ACCESSION 


hot (BlastX vs. Non-Redundant P 
DESCRIPTION 


rpteins) 
P VALUE 


652 


Z18921 


B.oleracea gene for S 
receptor kinase-like 
protein 


0.18 


3875535 


(zxftM i ) similar to nbolanasc; 
" cDNAkSl kMBL:D69553' " 
comes from this gene; cDNA 
EST EMBL;D65938 comes 
from this gene; cDNA EST 
yk280h9.3 comes from this 
gciic, uivii co i yiczouny.j 
comes from this gene; cDNA 
EST yk223d 11.3 come... 


le-19 


653 


M60650 


S.cerevisiac STA2 
gene, complete cds. 


0.16 


<NONE> 


<NONE> 


<NONE> 


654 


U80912 


Eucalyptus globulus 
NADP-isocitrate 
dehydrogenase 
(EglCDH) mRNA, 
complete cds 


0.16 


3766172 


(AF057298) ornithine 
decarboxylase antizyme 2 [Mus 
musculus] 


4.2 


655 


AF012899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


0.16 


76749 


hypothetical protein 4 - fowl 
iiucno.virus i 


4,0 


656 


AF027174 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Am- 
B) mRNA. complete 
cds 


0.16 


3044086 


(AF055904) unknown 


Q.oQ 


657 


AF030231 


Glycine max sucrose 
synthase (SS) mRNA, 
complete cds 


0.078 


<NONE> 


<NONE> 


<NONE> 


658 


M19183 


Woodchuck hepatitis 
ytrus (WHV), 
romplete genome* 
:loneWHV 59. 


0,072 


• 

1076190 i 


cell wall glycoprotein, 75K, 
precursor ■ diatom 
(Cylindrotheca fusiformis) 
>gi|5 15363 (X80394) P75K 
gene product [Cylindrotheca 
fusiformis] 


6.3 


659 


I 

I 
i 
F 

U31557 c 


J vine adenovirus 
Va2 protein gene, 
DNA polymerase 
;ene. terminal protein 
jene and 52.55 kDa 
>rotein gene, partial 
ds 


0.072 


( 

3511143 [ 


AJF061244) unknown 
Agrocybe aegeriia] 


6.2 



WO 01/02568 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 

ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Caenorhabditis 










660 


AL021491 


clegans cosmid 
Y44A6B. complete 
sequence 
[Caenorhabditis 
elegans] 


0.070 


<NONE> 


<NONE> 


<NONE> 


661 


M33874 


XJaevis Xotch 
complete cds. 


0.070 


1654096 


(Y09076) RAD3 
[Schizosaccharomyces pombc] 


0.23 


662 


ABO 12725 


Mus musculus 
ZAN75 mRNA for 

7i nr ft now nrotein 

complete cds 


0069. 


1350800 


MITOCHONDRIAL 
RIBOSOM AL PROTEIN S5 


2.0 


663 


AL02149L 


Caenorhabditis 
elegans cosmid 
Y44A6B, complete 
sequence 
[Caenorhabditis 
elegansl 


0.068 


<NONE> 


<NONE> 


<NONE> 


664 


Z60318 


H.sapiens CpG DNA, 
clone lei, reverse 
read cpglel.rla . 


0,068 


1280134 


(U55376)F16H1L2 gene 
product [Caenorhabditis 
eleaans] 


2.6 


665 


Z35973 


Scercvisiac 
chromosome II 
reading frame ORF 
YBRi04w 


0.068 


2493000 


KKOBABLh \ L- 
COA:3-KETOAC1D- 
COENZYME A 

TRANSFERASE PRECURSOR 
EMBL:Z 14816 comes from this 
gene; cDNA EST 
EMBL:Z 14946 comes from this 
gene; cDNA EST 
EMBL-D69746 comes from this 
gene; cDNA EST yk2l9b6.3 
comes from this gene; cDNA 
ES... 


0.63 


666 


Z861U 


Streptomyces lividans 
rpsP, trmD, rplS» 
sipW, sipX, sipY, 
sipZ, mutT genes and 
4 open reading 
frames 


0,068 


1235974 


(X96713) collagen [Globodera 
pallidal 


4e-04 


667 


M729S0 


Anthonomus grand is 
vitellogenin gene 
(VTG). complete cds. 


0.068 


3242750 


(AC005164) match to ESTs 
AA73U49 (NID:g2140138) t 
AA73I90S (NID:g27527l9), 
AA287837 (NID:gl933519), 
AA262811 (NID:g 1 898382), 
and AA825S20(NID:g2S99l32) 


le-59 



7*S 
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ID 


Col 

ACCESSIOh 


1 DESCRIPTION 


jenbank) 
P VALUE 


Nearest Neigh 
ACCESSION 


bor (BlastX vs. Non-Redundant P 
DESCRIPTION 


roteins) 
P VALUE 


668 


M3416I 


Rat tachykinin (PPT) 
gene, exons 5 and 6, 


0.067 


<NONE> 


<NONE> 




669 


' L03811 


Aspergillus itigcr zinc 
finger protein (creA) 
gene, complete cds. 


0.067 


<NONE> 


<NONE> 




670 


M64983 


Human fibrinogen 
beta chain gene, 
complete mRNA. > 
gb|I47706|I47706 
Sequence 3 from 
patent US 5639940 


0.067 


<NONE> 


<N0NE> 


<NONE> 


671 


AF014051 


Nicotiana tabacum 
Mg chelatase subunit 
(ChlH) mRNA, 
partial cds 


0.067 


<NONE> 


<NONE> 


<NONE> 


672 


Y07540 


H.sapiens sil gene 


0.067 


92331 


glycoprotein GP330, renal - rat 
(fragments) 




673 


AJ000347 


Rattus norvegicus 
mRNA for 3*(2')3- 
bisphosphate 
nucleotidase 


0.067 


129238 


25 RD OOKINETE SITRFaCE 

ANTIGEN PRECURSOR 
PRS25) >gi|320962lDirllA44966 
25k ookinete surface antigen 
precursor - Plasmodium 
reichenowi reichenowi] 


7,4 


674 


L19979 


Squid sodium channel 
mRNA, complete cds. 


0.067 


2128473 


hypothetical protein MJ0750 - 
Methanococcus jannaschii 
>gi| 1 592304 (U6752 1) 
Ferredoxin-type protein 


L5 


675 


X08050 


Yeast tRNA G!u(3) 
gene and flanking 

regions 


0.067 


1334398 


(XI 5081) MURF2 protein (AA 
1-348) 


0.65 


676 


] 
) 

X17I15 ( 


Human mRNA for 
[gM heavy chain 
complete sequence 


0.067 


1 
) 

1731331 i 


VRiTHEf ICAL 51.6 KD 
PROTEIN CY49.14C 
>gi| 1 37024 1 |gn!|PIDje247089 
£73966) hypothetical protein 
Rv2075c [Mycobacterium 
ubcrculosis] 


0,51 


677 


. I 
i 
( 

AF032871 a 


Iomo sapiens 
mcoupling protein 3 
UCP3) gene, exon 1 
nd partial exon 2 


0.067 


I 

s 

S 
a 

112900 |f 


\LPHA-JC-1 AUULNkRUIC 
RECEPTOR human >gi|178l94 
J03853) kidney alpha-2- 
idrencrgic receptor [Homo 
apiens] >gi|1628638 (U72648) 
lpha2-C4-adrenergic receptor 
Homo sapiens] 


0.50 
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Nearest Neighbor (BlosiN vs. Genbank) 


Nearest Neighbor (BlastX vs* Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












oVWamIt^ $ (DYNAMIN, 




678 


X05319 


Mouse class 0 MHC 
E-beta 2 (d) gene 
exon 3 


0.067 


585074 


TESTICULAR) rat 
>gi|39 1 872|gnl|r LP|d lUUJooS 
(D 14076) testicular dynamin 
Rattus norvegicus] 


3e-04 


679 


AB006362 


Candida albicans 
CaSLNl gene, 
complete cds 


0067 


3417296 


( ACOujOfJ/) Unknown gene 
product (partial) [Homo sapiens! 


9e-56 


680 


AFD21236 


African horse 
sickness virus capsid 
VP3 (L3) mRNA, 
complete cds 


0,066 


<NONE> 


<NONE> 


<NONE> 


681 


AEOOI507 


Helicobacter pylori, 
strain J99 section 68 
of 132 of the 
complete genome 


0.066 


<NONE> 


<NONE> 


<NONE> 


682 


AF0397I7 


Caenorhabditis 
elegans cosmid 
R13H8 


0.066 


<NONE> 


<NONE> 


<NONE> 


683 


AF029027 


Syncerus cafYer 
isolate Queen 
Elizabeth Mweya 14 
mitochondrial DNA 
control region 


0,066 


<NONE> 


<NONE> 


<NONE> 


684 


AF087967 


Homo sapiens full 
length insert cDNA 
clone YU51G05 


0.066 


2982476 


(X97203) CI protein [Beet curly 
top virus] 


9.5 


685 


J02037 


Baboon endogenous 
virus pro viral long 
terminal repeat DNA. 


0,066 


972767 


(L37868) POU-domain 
transcription factor [Homo 
sapiensl 


1 7.3 


686 


AF000141 


Lycopersicon 
esculentum class I 

KriL>UCU-llN.C 

homeodomain protein 
(LeT6) mRNA, 
complete cds 


0.066 


3157926 


(AC002131) Strong similarity to 
extensin-like protein gb|Z34465 
from Zea mays. [Arabidopsis 
thaliana] 


5.6 


687 


AB00L746 


Bensingtonia sp. 
OK255 gene for L8S 
rRNA > :: 

dbj|AB00l747|AB00 
1747 Bensingtonia 
sp. OK259 gene for 
IBS rRNA 


0.066 


3859889 


(AF070064) cap 'n* collar 
isoform C [Drosophila 
melanogastcr] 


! 0.38 



1%S 
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Nearest Neighbor (BlastN vs. Genbank) 


i Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 




DESCRIPTION 


d vai in- 




DESCRIPTION 


r y/vLUn 






Helicooacter pylori, 












AE001461 


strain J99 section 22 
of 132 of the 
complete genome 


0.065 


<NONE> 


<NONE> 


<NONE> 




M30821 


Chicken erythroid 
transport proteins c L 
and c2 


0.065 


<NONE> 


<NONE> 


<NONE> 


690 


AB009802 


Homo sapiens gene 
for osteonidogen, 
intron 3 


0065 


<NONE> 


<NONE> 


<NONE> 


691 


AF086062 


Homo sapiens full 
length insert cDNA 
doneYZ06Bll 


0.065 


<NONE> 


<NONE> 


<NONE> 


692 


AB002369 


Human mRNA for 
K1AA0371 gene, 
complete cds 


0065 


2500884 


SIGNAL SEQUENCE 
BINDING PROTEIN binding 
protein [Synechococcus sp.] 


5.5 




AF086864 


Cyclopodia sp. large 
subunit ribosomal 
RNA gene, 
mitochondrial gene 

fiir m ltrir*nr^nHrl ,, lI 

RNAs, partial 
sequence >:: 
gbjAF086866|AF086 
866 Penicillidia sp. 
large subunit 
ribosomal RNA gene, 
mitochondrial gene 
for mitochondrial 
RNAs, partial 
sequence 


0.065 


3721684 


(ABO 12957) probable glycosyl 
transferase [Vibrio cholerael 


5-5 


694 


L44593 


Bacteriophage BK5-T 
ORF410, 3' end pf 
cds, 20 ORFs. 
repressor protein, and 
Cro repressor protein 
genes, complete cds, 
ORF70* gene, 5' end 
of cds. 


0.065 


1172067 


PEPTIDASE T 
(AMINOTRIPEPTIDASE) 
influenzae Rd] 


3,2 


695 


U80079 


Ciona intestinalis 
MyoP-family protein 
(CiMDFa) mRNA, 
complete cds 


0.065 


4218110 


(AL035353) contains EST 
gb:F15281 


2.5 
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SEQ 
ID 



696 



Nearest Neighbor (BlastN vs, Genbank) 



ACCESSION! DESCRIPTION 



Homo sapiens mRNA 
for KIAA09U 
AB0207 1 8 protein, complete cds 



P VALUE 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins" 



ACCESSION 



DESCRIPTION 




0.065 



MINOR CAPSID PROTEIN L2 
1722734 >giji020192 type 23} 



697 



JZca mays copia like 
retrotransposon Stl 
1 14 leader region. 
AF082137 [partial sequence 



0.065 



(U89278) polyhomcotic 2 
1877501 homolog fHomo sapiens] 



698 



699 



|R.norvegicus ZnBP 
gene for zinc binding 
X64053 [protein 



0.065 



464963 ITRYPSIN PRECURSOR 



[Mus musculus 
jbutyrophilin (BTN) 
[gene, promoter region 
U67065 and complete cds 



0,065 



hypothetical protein YPL263c 
2132252 [yeast 



700 



701 



702 



703 



Rat matrin F/G 
M64862 ImRNA, complete cds 
IVeast (S.ccrevisiae) - 



0.065 



3420183 



(AF041 105) organic anion 
[transporter protein 3 [Rattus 
Inorvegicus] 



K02205 



transcriptional 
[activator of amino 
[acid-biosynthetic 

genes (GCN4) gene, 
[complete cds. 



0.064 



X58282 



(Maize mRNA For a 
jhigh mobility group 
[protein 



<NONE> 



<NONE> 



0.064 



AC001545 



[Homo sapiens 
(subclone IJ3 from 
PI H69) DNA 
sequence 



<NONE> 



<NONE> 



0.064 



<NONE> 



<NONE> 



704 



AF023461 



[Homo sapiens 
FRA3B region 
sequence 



0.064 



<NONE> 



<NONE> 



705 



706 



707 



U50307 



Caenorhabditis 
elegans cosmid 
F43H9. 



U46542 



[Streptococcus crista 
HmpA gene, partial 
cds. putative 
adhesin/ABC 
transport system 
protein (scbA) gene, 
complete cds 



0.064 



<NONE> 



<NONE> 



A.rusticana mRNA 
X57564 [lor neutral peroxidase 



0.064 



1209391 



(D83659) TPR protein pombe] 
>gi|2894282|gnl|PID|el251 103 
(AI.021838) pre-mrna splicing 
factor, [Schizosaccharomyces 
pombe] 



0.064 



1492037 



(U60315) MC094R [Molluscum 
|contagiosum virus subtype L] 



3\1 



WO 01/02568 



PCT/US00/18374 



SEQ 
ID 


Nearest 


Neighbor (BlastN vs. ( 
Human alpha-2- 


jenbank) 
P VALUE 


! Nearest Neiph 
ACCESSION 


jot (BlastX vs. Non-Redundant Pi 
DESCRIPTION 


rotetns) 
P VALUE 


708 


U06986 


" macroglobufin ~ — 
receptor/1 ipoprotein 
receptor protein 
(A2MR/LRP)genc, 
exons 39-41. 


0.064 


100800 


rabl5B protein - wheat 
>gi|21853 (X62476)rab protein 
[Triticum aestivum] 


5,3 


709 


D85773 


Human CpG island 
sequence, clone 
Q28B8 


0.064 


2245382 


(U88325) suppressor of 
cytokine signalling- 1 [Mus 
musculus] 


5.3 


710 


L06178 


Apis mellifera 
ligustica complete 
mitochondrial 
genome 


0.064 


3695379 


(akwoj jO) contains similarity 
to aC eiegans hypothetical 
protein F44G4.1 (GB:Z49910) 
and several yeast hypothetical 
proteins such as 35. 1 KD 
protein in NAM8-GAR1 
intergenic region (SP:P38805) 
[Arabidopsis thalianal 


3.2 


711 


Y 16242 


Triticum aestivum 
mRNA for beta- 
amylase 


0.064 


1175958 


HYPUlfenCAi yo.^KD — 
PROTEIN IN AGP3-DAK3 
INTERGENIC REGION 
>gi|1084712|pir||S5620l 
probable membrane protein 
YFL054c * yeast 
(Saccharomyces cerevisiae) 
>gi|83670 1 |gnl|PID|d 1009825 
(D50617) YFL054C 


3.1 


712 


L81779 


Homo sapiens 
(subclone 2_a2 from 
PI H25) DNA 
sequence 


0.064 


3845169 


(AE001391) phosphatase (acid 
phosphatase family) 


0.81 


713 


XI 3826 


C.reinhartfiii psbl 
mRNA for OEE1 
protein of 
photosystem D 
oxygen-evolving 
enhancer protein) 


0.064 


171040 


(M94535) ATPase 
Saccharomyces cerevisiae] 
:erevisiae, Peptide, 377 aa] 
Saccharomyces cerevisiae] 


0,054 


714 


1 

X06487 1 


^.sapiens mRNA for 
jcl2-lg fusion gene 


0.064 


( 

2429362 


AF020261) proline rich protein 
Santalum album] 


0.016 


715 


1 
c 

( 

U79638 e 


Wlus musculus cyclin- 
Jcpendent kinase 
nhibitor protein 
pl5<lNK4b)) gene, 
xon 2 and partial cds 


0.064 I 


( 
a 

3929221 f 


AF032557) TRF1 -interacting 
nkyrin-related ADP-ribose 
polymerase [Homo sapiens] 


le*lQ 



WO 01/02568 



PCTYUS00/18374 



SEQ 
ID 


Nearest 
ACCESSION 


Neighbor (BlastN vs. < 

\ DESCRIPTION 
Human T cell 


jenbank) 
P VALUE 


1 Nearest Neinh 
I ACCESSION 


Dor (BlastX vs. Non-Redundant Pi 
DESCRIPTION 


roteins) 
P VALUE 


716 


U39099 


receptor alpha chain 
mRNA. partial cds 


0.063 


1 <NONE> 






717 


U39673 


Clostridium 
acetobutylicum KdpC 
(kdpQ gene, partial 
cds. sensor histidine 
kinase homolog 
(kdpD) and response 
regulator homolog 
(kdpE) genes, 
complete cds 


0.063 


1 <NONE> 


<NONE> 


<NONE> 


718 


AL022317 


Human UNA 
sequence from clone 
140Llon 

chromosome 22qI3.1- 
13,31, complete 
sequence [Homo 
sapiens] 


0.063 


1931640 


(U95973) Serine 
carboxypeptidase isolog 
[ Arabidopsis thaliana] 


5.2 


719 


U28972 


Spiroplasma cirri orfa 
and orff genes, partial 
cds, orfb, orfc. and 
orfe genes and 
Spiroplasma virus 
SpVl-derived ORF1 
and ORF3 genes, 
complete cds, and 
SpVl-derivedORFI4 
gene, partial cds. 


0.063 1 


4091939 


(AF070704> envelooe 
glycoprotein (Human 
immunodeficiency virus type 1] 


5.2 


720 


U15159 


Mus musculus limk 
Icinase (limk) mRNA, 
:ompletc cds 


0.063 1 


3638957 


(AC004877) sco-spondin-mucin- 
ikc; similar to P98167 uncertain 
'Homo sapiensl 


5.1 


721 


1 
i 
( 

AF058416 t 


Homo sapiens 
ipoprotein receptor- 
elated protein 
LRPl),exons 39, 40, 
ind41 


O.063 1 


( 

1788123 


AE000276) orf, hypothetical 
>rotcin [Escherichia coli] 


4.0 


722 


I 

f 

c 
s 
t 

AE001430 s 


Plasmodium 
alciparum 
•hromosome2, 
ection 67 of 73 of 
he complete 
equencc 


0.063 1 


2244S49 ( 


Z97337) hypothetical protein 


4.0 



WO 01/02568 



PCT/USOO/18374 





! Nearesr N<-iohk^w mhcrN vc fUnkinl^ 




SEQ 
ID 


ACCESSIOr* 


J DESCRIPTION 
Streptococcus 


P VALUE 


ACCESSION 


uor loiasiA vs. non-Keaunaant r 
DESCRIPTION 


roteins) 
P VALUE 


723 


L29323 


pneumoniae meihyl 
transferase gene 
cluster, complete 
sequence 


0.063 


387402? 


(Z70203)cDNAEST 
EMBUD72339 comes from this 
gene; cDNA EST 
EMBL:D75197 comes from this 
gene [Caenorhabditis eleeans] 


2.3 


724 


X72631 


H. sapiens mRNA 
encoding Rev- 
ErbAalpha > :: 
emb|X72632|HSREV 
ERB2 H.saptens 
mRNA encoding Rev- 
ErbAalpha (internal 
fragment) 


0.063 


3979878 


" Ci/JlU^J predicted usinf 

GenefindencDNAEST 
EMBL:T01277 comes from this 
gene; cDNA EST 
EMBL:T01796 comes from this 
gcne;cDNA EST 
EMBL:D32545 comes from this 
gene; cDNA EST 
EMBL:D33060 comes from this 
gene; cDNA EST EMBL:D... 


1.7 


725 


U17969 


Human initiation 
factor cIF-5A gene, 
complete cds. 


0.063 


2429509 


(AF025467) contains similarity 
to drosophila DNA-binding 
protein K10 (NID:g814S) 
[Caenorhabditis elegans] 


L4 


726 


AE001000 


Archaeoglobus 
fulgidus section 107 
of 172 of the 
complete genome 


0,063 


3462802 


(AF082486) nef protein [Human 
immunodeficiency virus rvne 1 1 


O i5 


727 


S80986 


svp[40]=svp-related 
nuclear 

receptor/retinoid 
signaling modulator 
zebrafishes. mRNA, 
3876 nt] 


0,063 


1326288 


(U58734) weak similarity to 
ankyrin G [Caenorhabditis 
eleeans] 


0.093 


728 


AF109134 


Homo sapiens 7-60 
mRN A/complete cds 


0063 


1083764 


proline-rich proteoglycan 2 
precursor, parotid - rat 
>gi|3l0200(L17318) proline- 
rich proteoglycan [Rattus 
norvegicus] 


0.001 


729 


D87466 


Human mRNA for 
KIAA0276gene, 
partial cds 


0.063 


2879865 


!AL0218i6) $PBC24E9.03c> 
jnknown, len:251aa 
Schizosaccharomyccs pombel 


6e-05 


730 


j 

ABO 18269 | 


Homo sapiens mRNA 
For KIAA0726 
jrotein, complete cds 


0,063 


( 

2995865 


AF053455) tetraspan TM4SF 
Homo sapiens] 


2e«l6 


731 


< 
r 
( 

D86954 : 


2riceiulus griseus 
nRNA for 
:ytochrome P-450 
>A14, complete cds 


0.063 


( 

V 

t 

2496896 [ 


VPOTtffiTiCAL 41.6 K5 
3 ROTEINC16C10.5 IN 

:hromosome in 

>gi|3S743S3|gh1|PID|e 1344077 
ype (RING finger) 
Caenorhabditis elegans] 


le-22 



WO 01/02568 



PCT/US00/18374 



SEQ 

10 I ACCESSION 



Nearest N eighbor (BlastN vs. Genbank) 

DESCRIPTION | p VALUE 



jjeg£egnv^hbpr (BlastX vs. Non-Redundant Pmi^T 



732 



Plasmodium 
(falciparum DNAT' 
[SEQUENCING IN 
PROGRESS *** 
I from concig 4-58, 
AL010232 [complete sequence 



ACCESSION 



733 | U907I4 



(Mycoplasma 
gallisepticura 
haemagglutinin 
precursor genes, 
[complete cds 



0.062 



<NONE> 



734 I AF 107044 



IHomo sapiens clone 
pCL4 DNA-binding 
[protein SOX21 
|(SOX21)gene, 
complete cds 



0,062 



<NONE> 



735 



L41729 



jcaenorhabditis 
jelegans Ro 
Inbonucleoprotein 
lautoantigen mRNA, 
[complete cds 



0.062 



<NONE> 



0.062 



2983060 



736 i 



737 



738 



ICaenorhabditis 
clegans cosmid 
I Y7A9D, complete 
sequence 
[[Caenorhabditis 
299287 eiefi ansl 



DESCRIPTION 



<NONE> 



<NONE> 



1 <NONE> I 



<NONE> 



I <NONE> I 



(AE000687) putative protein 

Aouifex aeolicus] 
FUlAllvk 



0.062 



1176542 



Homo sapiens mRNAj 
forKIAA0614 



ABQ145I4 [protein, partial cds 
IHuman germiine 



0.062 



4033395 



DNA GYRASE SUB UNIT B 
subunit [Myxococcus xanthusl 



L29165 



immunoglobulin light J 
[chain variable region 
(lambda-IIIb 
[subgroup) from IgM 
[rheumatoid factor. 



(Schistosoma 
Ijaponicum Chinese 
Iclone pY6 



0,062 



1914685 



POTOTVE 
SER INE/THREONINE- 
PROTEIN KINASE D 1044,3 
IN CHROMOSOME III 
>gi|495684 (U00065) contains 
EGF-like repeats; highly similar j 
to ZC84.1; 3* exons similar to 
protein kinase [Cacnorhabditis 
elegans] 



8.6 



5.8 



3.9 



(Y12014) RAD23 protein, 
isoform II 



1.3 



WO 01/02568 



PCT/US00/I8374 





Nearest 


Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Protein^ 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Leishmania pifanoi 






TROPOMYOSIN I (TMI) 




741 


M97695 


cysteine proteinase 
^cy&- ) gene, cumpicie 

cds. 


0.062 


1174754 


(POLYPEPTIDE 49) 
>gi|32G989|pir||A60607 
tropomyosin - fluke 


0.018 


742 


XJ61 526 


Meihanococcus 

JttllIl«UM>f]ll 5t%uun Do 

of 150 of the 
complete genome 


0,062 


1330345 


1058;J5) coded lor by C — 
elegans cDNA yk34bl.5; coded 
for by G elegans cDNA 
ykl3hl0.5; coded for by C. 
ciegans cunA yK4oed.jj coded 
for by C. elegans cDNA 
yk46d5.5; coded for by C 
elegans cDNA yk43c2.5; coded 
for by C. elegans cDNA 
yk46e8,„. 


le-40 


743 


Z78414 


Caenorhabdius 
elegans cosmid 
W09D12, complete 
sequence 
[Caenorhabditis 
elegans] 


0.061 


<NONE> 


<NONE> 


<NONE> 


744 


Y13606 


Mus musculus gene 
encoding filensin, 
exons 6. 7 


0.061 


2314715 


(AE000651) H. pylori predicted 
coding region HP 1527 


4.9 


745 


J04374 


Eggplant mosaic 
virus genome. * 


0.061 


141449 


xl I r\J 1 Me. i K, AL ISJJ 
PROTEIN IN TRANSPOSON 
TN4556 >gi|80759|pir||JQ043 1 
hypothetical 35 J K protein - 
Streptomyces fradiae transposon 
Tn4556 


3.8 


746 


AB022200 


vfarine oblieatelv 
oligotrophic 
bacterium POO- 10 

DMA for lfv<5 
ribosomal RNA t 
partial sequence 


0.061 


3983593 


(AB000307) transcarboxylase- 

?eta 


2.2 


747 


X54250 


Rat mRNA for zinc 
finger protein AT- 
BP2, partial cds 


0.061 


1377886 


(L46815) DNA binding protein 
Rc [Mus musculus] 


0.9S 


748 


■ 

X69942 


vt.musculus mRNA 
af enhancer-trap- 
ocus 1 


0.06! 


2983969 


(AE00074S) putative protein 
Aquifcx aeolicus] 


0.57 1 


749 


AJ223206 


Mus musculus mRNA 
: 0f scrapie responsive 
protein I 


0,061 


4204265 


(AC005223) 45643 
Arabidopsis thaliana] 


5e-31 


750 


Y 10205 


H.sapiens mRNA for 
~DSS protein 


0.060 


<NONE> 


<NONE> 


<NONE> 



WO 01/02568 



PCT/US00/18374 




WO 01/02568 



PCT/USOO/18374 



^1 Nearest 


Neighbor (BlastN vs, Genbank) 


1 Nearest Neighbor (BlastX vs. Non-RedunHnnr P 


roteins) 1 


SEQ 
1 ID 


UccESSior 


M DESCRIPTION 
mouse rnivrN a i or 


J P VALUE 


1 ACCESSION 


j DESCRIPTION 


P VALUE 


762 


AB000096 


GATA-2 protein, 
complete cds 


! 0.023 


1 <NONE> 


1 <NONE> 


<NONE> j 


763 


Z62366 


H.sapiens CpG DNA 
clone 67h7 ? forward 
read cp^67h7.ftla. 


f 0,023 


1 3123312 


[ZINC FINGER PROTEIN 142 
(KIAA0236) to Human zinc 
finger protein(ZNF142) [Homo 

(sapiens) 


5.9 1 


764 


LU670 


Human 

transmembrane 
glycoprotein (ljjjj) 
gene, exons 2 througr 
8. 


0,023 


1 - 80636 


[hypothetical 67K protein - 
Mycobacterium fortuirum 
plasmid pAL5000 >gi|149986 
(M60875) ORF2 


3.4 I 


I 765 1 




5Jll IfH ill IIIM ■ II ij ill I 

ouJiuiub til versicolor 
DNAforlDO-likc 
myoglobin, complete 
ca$ 


0.023 


3114665 


(AF061267) inner membrane 
component HUE [Pseudomonas 
stutzeri] 


3,4 I 


1 766 J 


X98890 


^tuberosum mRNA 
for inorganic 
phosphate 
transporter, StPTl 


0,023 j 


* 683532 


(X02155) thyroglobulin [Bos 
taurus] 


it 1 


767 1 


U58835 


Dissostichus mawsoni 
preprotrypsin gene, 
complete cds i 


0.022 


<NONE> 


<NONE> 


<NONE>l 


768 J 


AJ009630 


Glomus versiforme 
chitin synthase gene 
(clone Gvchs3) 


0.022 


<NONE> 


<NONE> 


<NONE>| 


769 1 


J04040 


Human glucagon 
mRNA, complete cds.l 


0,022 1 


<NONE> 1 


<NONE> 


<NONE> 1 


770 


1 

X74908 


Lesculentum Asr3 
gene 


0.022 1 


<NONE> 1 




<NONE>| 


771 1 


< 
( 

f 
t 
a 
( 
t 

( 
r 

L07293 c 


Shigella dyscnteriae I 
3-antigen 
K>lysaccharide 
iiosynthesis rfbX. 0- 
ntigen polymerase I 
rfc), rhamnosyl j 
anferase I and II 
rfbR and rfbQ) and 1 
fbD genes, complete 
ds. _L 


0.022 | 


<NONE> j 


<NONE> 


cNONE>| 



WO 01/02568 



PCT/US00/18374 



Nearest Neighbor ( BlastN vs. Genbank) 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



SEQ| 
ID 



[ACCESSION ! DESCRIPT ION 
|M us musculus 



P VALUE 



ACCESSION 



772 | AF040094 



I inositol 

polyphosphate 5- 
phosphatase II 
(INPP5P) mRNA, 
I complete cds 



773 | X76776 



Hsapiens HLA-DMB 
Jgenc 



0.022 



<NONH> 



0.022 



<NONE> 



DESCRIPTION 



<NONE> 



<NONE> 



p value! 



<NONE> 



<NONE> 



774 



[Helicobacter pylon, 
jstrain J99 section 82 
of 132 of the 
AE001521 [complete genome 



0.022 



<NONE> 



775 I X16004 



AJonp rbcL, rpl5» 
rp$8* rpl36 t rpsl4, 
rps2, trnI t trnF. trnC 
and rpoB (partial) 
genes > 
embpC7565 1|ALRIBF 
A.fonga plastid genes 
forribosomal 
proteins, tRNAs, 
RNA polymerase 
subunit beta and 
mbisco large subunit 



0.022 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE>| 



776 1 



ILactococcus lactis 
crcmoris plasmid 
|pHW393 DNA, 
Y 12707 [rlladii, mjladii genes 



0.022 



<NONE> 



<NONE> 



<NONB> 



777 



778 



779 



780 



Arabidopsis thaliana 
glutamyl-tRNA 
U27I18 (reductase 



0.022 



<N0NE> 



Hsapiens telomeric 
DNA sequence, clone 
5PTEL002, read 
Z96622 5PTELOO002.seq 



<NONE> 



0.022 



191333 



[J05503) carbamoyl-phosphate 
ynthetase (E.C.6.3,5 5) 



Sulculus diversicolor 
IDNA for IDO-like 
myoglobin, complete 
D83984 cds 



0.022 



[H.sapicns flow-sorted 
[chromosome 6 
[Hindi [I fragment, 
Z77952 [SC6pA4A3 



1078509 



0.022 



4204206 



probable membrane protein 
YDR018C- yeast 



(AB022786) N-acetyl-beta-D- 
glucosaminidase [Enterobacter 
sp I 



<NONE> 



9.8 



9.7 



7.5 



WO 01/02568 



PCT/US00/18374 



SEQ 
ID 


Nearest 
ACCESSIO* 


Neighbor (BlastN vs. < 

i DESCRIPTION 
Xenopus laevis 


3enbank) 
P VALUE 


Nearest Neiph 

_ ACCESSION 


bor (BlastX vs. Non-Redundant P 
DESCRIPTION 


roteins) 
P VALUE 


781 


MI0217 


mitochondrial DNA, 
complete genome. 


0.022 


2145763 


B2I6B_C2_205 protein- 


f.j 


782 


M55I47 


Pea chloroplast 
glyceraldchyde-3- 
phosphate 
dehydrogenase 
(Gpbl) gene, 
complete cds. 


0.022 


417308 


PROBABLE HEL1CASE 
a MOT1 Motlp is a probable 
helicase essential for vegetative 
growth on rich glucose medium 
at 30 degree Ci Swiss-Prat 
Accession number P32333; 
similar to S. cerevisiae RAD26 
gene product: Swiss-Prot 
Accession number P40352 


4.2 


783 


X58839 


Acholeplasma virus 
MV-L1 DNA for 
complete circular 
genome 


0022 


3273189 


(AB008757) subunit II of 
c(o/b)3-typc cytochrome c 
oxidase [Bacillus 
scearothermoph i lus] 


4J 


784 


M26185 


Mouse c-myb 
oncogene, cxon I and 
exon 2 (partial). 


0.022 


138592 


Vil'tLLUGfiNlH'l " 
PRECURSOR (YOLK 
PROTEIN 1) 
>gi|72270|pir||VJFFi 
vitellogenin I precursor 
unnamed protein product 
Drosophila melanogaster] 


2.5 


785 


AF061195 


Streptomyces albus 
valine dehydrogenase 
(Vdh) gene, complete 
cds 


0.022 


2088768 


(AF003145) B0414.8 gene 
product (Caenorhabdiiis 
elegansl 


0.86 


786 


AF053622 


Homo sapiens alpha 
1,2-mannosidaseIB 
gene, exon 9 


0.022 


1352361 


EARLY GROWTH 
RESPONSE PROTEIN 1 fish 
>gi|531456 (U12895) egrl 
[Danio rerio] rerio] 


0.36 


787 


< 
i 

Z71500 


Scercvisiae 
:hromosome XIV 
reading frame ORF 
WL224c 


0,022 


< 

1708875 j 


PUTATIVE TUMOR 
SUPPRESSOR LUCA15 
japiensl 


0,16 


788 


J 

t 

i 

D10471 c 


herpes simplex virus 
ype 2 genomic DNA 
or 0.74-0.84 region, 
•omplete cds 


0.022 


( 

3132276 v 


AB0I14S6) short ORF [TT 
'irusl 


0.13 


789 


c 
s 
f 

U43082 c 


Its mays T 
yioplasm male 
terility restorer 
actor 2 (rf2) mRNA, 
omplete cds 


0.022 


( 
d 

3319720 c 


AL03I035) putative aldehyde 
ehydrogenase [Streptomyces 
oelicolor] 


0.01 1 



WO 01/02568 
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Nearest Neighbor (BlastN vs. Genbank) 



Nearer Neighbor (BlastX vs. Non-Redundant Proteins) 



SEQ 

ID IaCCESSIOnI DESCRIPTION 



790 



791 



792 



793 



794 



[H.sapiens simple 



tandem repeat DNA 
X86913 (clone w g 3a6) 



[Mus musculus 
Poniin52 mRNA, 
AF1 00694 complete cds 



Nonnostomus sp. 
large subunitrRNA 
gene, mitochondrial 
gene encoding 
mitochondrial rRNA 
U34016 [partial sequence. 

Yeast mitochondrial 



genes for 15S rRNA 
X00845 and tRNA-Trp 

I Homo sapiens gene 



forCCchemokine 
PARC precursor, 
AB012113 Icompletecds 



P VALUE 



ACCESSION 



0.021 



<NONE> 



0.021 



<NONE> 



0.021 



<NONE> 



0.021 



<NONE> 



0.021 



<NONE> 



DESCRIPTION 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



P VALUE 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



795 



796 



[Daucus carota 
[globulin-like protein 
|(Gea8) gene, 
U62395 [complete cds 



0.021 



<NONE> 



<NONE> 



<NONE> 



IP. falciparum actin II 
M227 1 8 gene, complete cds. 



0.021 



2623773 



(AF004S35) tyrocidine 
ynthetase 3 [Brevibacillus 
brevis] 



8.8 



797 



798 



lArabtdopsis thaliana 
[glulamyl-tRNA 
_U27118 [reductase 



H.sapiens CLN3 
JC99832 [gene, complete CDS 



0.021 



3549885 



(AJ006631)cysteine-rich 
secretory protein- 1 [Equus 
caballus] 



8.8 



0021 



262249 



(S520l0)orfl 5'ofEpoR [mice 
ide. 85 a a] [Mus sp,] 



gptij 
UC( 



8.7 



799 



Homo sapiens TRAIL 
[receptor 2 mRNA, 
_AF016266 [complete cds 



, 800 I Z92541 



Human DNA 
sequence from PAC 
179115, BRCA2 gene 
region chromosome 
I3ql2-I3 contains 
lactasc-phlori2in 
Ihydrolase (LCT) 



0.021 



729048 



ccInyET 

COAiCOENZYME A 
TRANSFERASE transferase 
Clos tridium kluy veril 



0.021 



585820 



LIPUKJLYSAlUlAKlDi: 1J2 
N- 

ACETYLGLUCOSAMINETR 
ANSFERASE >gi|466761 
(U00039) rfaK [Escherichia 
coli] >p\ I 790053 (AEOOO440) 
probably hexosc transferase; 
I ipopoly saccharide core 
biosynthesis 



8.7 



5.3 



WO 01/02568 



PCT/USOO/18374 



SEQ 
ID 


Nearest 

I 

ACCESSIOf 


Neighbor (BlastN vs. 

4 DESCRIPTION 
dopamine D2 


Genbank) 
P VALUE 


1 Nearest Neijth 
f ACCESSION 


bor (BlastX vs. Non-Redundant P 
DESCRIPTION 


roteins) 
P VALUE 


801 


S58588 


receptor [human* 
brain, Genomic, 379* 
nl. segment 4 of 5] 


! 0.021 


1 2677620 


^ ivw-?; ^Ax^r^+j—arginine 
ADP-ribosyltransferase 
{Oryctolagus cuniculus) 


5.1 1 


802 


M60522 


Rat nerve growth 
factor-indue iblc 
protein (VGF) gene, 
complete cds. 


0.021 


1 4103934 


(AF030050) replication factor C 
m [Rattus norvegicus] 


l 

3.1 J 


803 


AF045654 


Callus gallus 
neureguJin beta-la 
mRNA, complete cds 


0.021 


1 • 2746829 


(AF040647) No definition line 
found [Caenorhabditis elcgans] 


3.0 1 


804 


M69023 


Human globin gene. 


0.021 


3880259 


<Z82056)T26H5.8 
i\_4ici]uriiiioaius eiegansj 
>gi(3880787|gnI|PID|e 1350288 
(AL032620)T26H5.8 


2.4 1 


805 


265960 


H.sapiens CpG DNA„ 
clone 69d2, reverse 
read cpg69d2.rtlb . 


0.021 


1707245 


\%jQij<y*j) similar to family l ot 
G-protetn coupled receptors 
Caenorhabditis elegans] 


0.79 I 


806 


X97073 


A.oligospora gene 
encoding lectin 


0.021 


116949 


CORE ANTIGEN 

/jovi|pir||iNii. vllz core 
antigen - woodchuck hepatitis 
virus2>gi|336135 


0.47 1 


807 


X56491 


D. melanogaster 
mRNA for gene 
containing opa 
repetitive element 


0.021 


2842750 


HOMEOBOX PROTEIN DLX~ 
7>gi|1620520 


0,16 1 


808 


■ 

L7S760 


Homo sapiens 
(subclone 1 J6 from 
Pi H31) DNA 
sequence 


0.021 1 


113671 


! ! ! ! ALU CLASS F WARNING 
ENTRY !!!! 


0.15 1 


809 


} 
I 

A8007864 p 


forno sapiens 
CIAA0404 mRNA, 
artial cds 


0.021 J 


1 

< 

i 
I 

*. 

s 

5 

P 

118144 t 


L \ MJslNb SVN1HASE A(U- 
^CETYLSERINE 
5ULFHYDRYLASE A) (O- 
\CETYLSERINE (THIOL)- 
-YASE A) (CSASE A) 
►gi|68323|pir||SYEBAC cysteine 
ynthasc (EC 4.2.99.8) A - 
lalmonella typhimurium 
>gi| 153935 (M21450) cysK 
rotein [Salmonella 
^phimuriuml 


0 12 I 


810 


is 
t 
c 

AL021932 s 


Mycobacterium 
jbcrculosis H37Rv 
omplcte genome; 
egment 22/162 


0.021 1 


( 

2909514 p 


AL021932) hypothetical 
roteirt RvQ439c 


7e-I0 1 



WO 01/02568 



PCT7US00/18374 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















811 


U8999I 


Hypocreajecorina 
mannose- 1 -phosphate 
guanylyltransferase 
(MPG1) mRNA, 
complete cds 


0.021 


3581024 


(AL031538) mannose-1- 
phosphate guany Itransferase 
[Schizosaccharomyces pombe] 


6e-20 


812 


X00641 


Sugar beet 
mitochondrial 
minicircle pO 
sequence 


0.020 


<NONE> 


<NONE> 


<NONE> 


813 


250097 


D.melanogaster 
mRNA for hdc 
protein. 


0.020 


<NONE> 


<NONE> 


<NONE> 


814 


AF044866 


Phoebis sennae large 
subunit ribosomal 
RN A gene, partial 
sequence; tRNA-Val 
gene, complete 
sequence; and small 
subunit ribosomal 
RNA gene, partial 
sequence, 

mitochondrial genes 
for mitochondrial 
RNAs 


0.020 


<NONE> 


<NONE> 


<NONE> 


815 


AF074386 


Sambucus nigra 
hevein-like protein 
mRNA, complete cds 


0.020 


<NONE> 


<NONE> 


<NONE> 


816 


AF027174 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
B) mRNA, complete 
cds 


0,020 


<NONE> 


<NONE> 


<NONE> 


817 


AE001405 


Plasmodium 
falciparum 
chromosome % 
section 42 of 73 of 
the complete 
sequence 


0.020 


2196776 


(AF003342) bunched gene 
product [Drosophila 
melanogaster] 


8.4 


818 


AF074387 


Sambucus nigra 
hevein-like protein 
mRNA, complete cds 


0.020 


627071 


histidinc-rich protein - 
Plasmodium lophurae 


2.S 



